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METHOD FOR THE PREPARATION OF TRANSGENIC PLANTS 
CHARACTERISED BY GEMINIVIRUS LASTING RESISTANCE 



The present invention relates to a method for the preparation of 
transgenic plants lasting resistant to geminiviruses. 

More particularly the invention concerns a method for the 
preparation of transgenic plants lasting resistant to geminiviruses, wherein 
the transgene consists of a polynucleotide sequence, derived from the 
pathogen, suitably modified in order to result in an ineffective target of the 
post-trascriptional gene silencing induced by geminiviruses. 

It is known that geminiviruses are a wide and diversified class of 
plant viruses that infect several plants of agronomic interest causing 
serious harvest losses. Such viruses are characterised by virions 
consisting of two geminate icosahedric particles. Their genome, consisting 
of one or two circular single-stranded DNA molecules (ssDNA), replicates 
in the nucleus of infected cells through double stranded intermediates 
(Hanley-Bowdoin et al., 1999). 

The Geminiviridae family is divided in four genera named 
Mastrevirus, Begomovirus, Curtovirus and Topocuvirus based on the 
insect vector , the host spectrum and the genome structure (Briddon et a!., 
1985; Fauquet et al., 2003). 

A serious disease of the tomato plant, transmitted by the whitefly 
Bemisia tabaci, is from a long time known as "tomato yellow leaf curl" in 
the areas of the Middle East, Asian South East and Africa, (Czosnek et al., 
1997). This disease, that can cause harvest losses of 100% (Pico et al., 
1996; Czosnek et al., 1997), successively spread both throughout the 
Western Mediterranean, reaching Sardinia, Sicily and Spain (Czosnek et 
al., 1997), and America (Polston et al., 1997). 

Recently the agents of the disease have been identified and 
isolated, being viruses belonging to the Geminiviridae family, genera 
Begomovirus, Phylogenetic studies have highlighted the presence of 
different viral species related to different geographical origins of the 
Begomovirus: Asia, Africa and America (Czosnek et al., 1997). 

The genoma of the Tomato yellow leaf curl Sardinia virus (TYLCSV) 
species, is monopartite (Kheyr-Pour et al., 1991). The DNA is transcribed 
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bidirectionally and contains six open reading frame (ORF), two on the virai 
strand (V): V1 and V2, and four on the complementary strand (C): C1 , C2, 
C3 and C4, as shown in figure 1. Between the C1 and V2 ORFs there is a 
non-coding region named intergenic region (IR) analogous to that present 
5 in the genome of all Geminiviridae. The genomic organization of TYLCSV 
is structurally similar to that of the bipartite Begomoviruses component A 
such as the tomato golden mosaic virus (TGMV) and the African cassava 
virus (ACMV). In the case of bipartite Begomoviruses the nomenclature of 
the ORFs present on the component A of the complementary strand is: 

10 AL1 or AC1, AL2 or AC2, AL3 or AC3, AL4 or AC4, while on the viral 
strand AR1 or AV1 , AR2 or AV2; on the complementary strand of the 
component B is: BL1 or BC1 and on the viral strand BR1 or BV1. 

Strategies' used until now in order to control the infection of the 
geminiviruses transmitted by the Bemisia tabaci are based on the use of 

15 expensive fine mesh nets (for the cultivation of fresh-market tomato) and 
particularly on repeated insecticide treatments (cultivations of both fresh- 
market and processing tomato). Such strategies result in an increase of 
the production expenses and represent a serious danger for the health of 
the agricultural operators and consumer. Furthermore the onset of 

20 Bemisia tabaci populations resistant to the insecticide imidacloprid has 
been already reported (Cahill et al., 1996; Williams et al., 1996). 

The development of resistant cultivated species represents the 
most practical and economic way to control viral infections. Classical 
breeding programs for introducing resistance to geminiviruses that cause 

25 the tomato yellow leaf curl were based on the transfer of resistance genes 
from wild species of Lycopersicon to species of cultivated tomato. Thereby 
lines with variable levels of resistance to TYLCSV have been obtained and 
commercialized, the best lines showing reduced symptoms and low viral 
replication. However plants with low and mean levels of resistance 

30 represent a potential receptacle for further infections. 

Another important aspect to be considered is that the agronomic 
characteristics of the lines obtained are not always optimal and however 
reflects those of the genotype of cultivated tomato used in breeding 
programs. 

35 A tomato line immune to the viruses causing the tomato yellow leaf 

curl disease, namely, with neither symptoms nor viral DNA replication has 
not been released yet. 
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With the advent of genetic engineering new perspectives were 
opened up for the introduction of resistance characters against plant 
viruses. Most strategies are based on the introduction and expression of 
pathogen-derived sequences in the plant of interest, Pathogen Derived 
5 Resistance (PDR) (Sanford & Johnson, 1985; Abel et al., 1986; Tavazza 
and Lucioli, 1993). 

Although such strategies have been successfully applied for the 
introduction of resistance characters to plant viruses with RNA genome 
(Beachy, 1997), in the case of geminiviruses, with a DNA genome, the 
10 expression of pathogen-derived sequences has produced plants with no 
lasting resistance and/or tolerance. 

The mechanisms that induce virus resistance achieved through the 
expression of pathogen-derived sequences can be grouped in two wide 
classes: 

15 a) resistance mediated by the expression of a pathogen protein 

such as, for instance, the expression of a dominant negative mutant; 

b) resistance mediated by the post-transcriptional gene 
silencing (Baulcombe, 1996; Beachy, 1997; Zaitlin and Palukaitis, 2000). 

The post-trascriptional gene silencing is a ubiquitary process in 

20 eukaryotes, involving the degradation of specific RNAs following the 
formation of double strand RNA (dsRNA) molecules having sequences 
homologous to the target RNA. 

Although there may be different contexts able to induce the 
production of dsRNA homologous to the transgene (transcription of 

25 aberrant transgenic RNAs, presence in the transgenic RNA of sufficiently 
long inverted and repeated sequences, integration of the transgene in the 
plant genome in inverted and repeated multiple copies), once the dsRNA 
is produced, the latter is recognised and degraded in short molecules of 
dsRNA of about 21-26 nucleotides, referred to as siRNA. 

30 The siRNAs are then integrated in a multiprotein complex 

named RISC, that is able to degrade all RNAs having sequence homology 
with the siRNAs. The latter ones represent therefore the determining 
factors of RNA silencing specificity and their presence related to a 
determined sequence establishes univocally that this RNA sequence is 

35 post-transcriptionally silenced. 

Therefore, transgenic plants post-transcriptionally silenced for 
sequences derived from viral RNA genome, are resistant to the 
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homologous virus and to viruses with nucleotide sequences closely related 
to the transgene. 

The transgene silencing can be also induced following virus 

infection. 

5 In fact, viral replication is able to induce silencing of a 

transgene, initially not silenced, if the nucleotide sequence of the 
transgene is homologous to a portion of the infecting virus genome. The 
activation of the silencing mechanism involves the specific degradation of 
the RNA molecules having sequence homology with the inducer RNA. 
10 As direct consequence, the silencing activation by the virus is 

associated with a degradation of both transgenic mRNA sequences 
homologous to the virus and viral genome. This results in the host 
recovery after an initial infectious step, so that the new vegetative part is 
proved to be virus free. A peculiar characteristic of the plant tissues that 
15 develop subsequently to the recovery phenomenon is that they are highly 
resistant to a following infection by the same virus. 

The resistance mediated by post-transcriptional gene silencing, 
since based on recognition at the nucleotidic level, confers resistance 
only against viral isolates closeiy homologous to the virus genome from 
20 which the transgene was derived. Instead, strategies based on the 
expression of a pathogen protein normally produce plants resistant also to 
viral strains or isolates not-closely related from a nucleotide point of view. 

It is also been shown that the transgene silencing is influenced 
by the temperature, being inactive at temperatures below 15°C (Szittya et 
25 al., 2003). Therefore plants exposed in field conditions at temperature 
range below 15°C can lose the silencing-mediated resistance. 

It must be borne in mind that, although from several years 
transgenic plants resistant to RNA genome viruses have been achieved 
through mechanisms based on transgene silencing, so far it is not 
30 reported that such strategy can be successfully applied to the 
geminiviruses (DNA genome-viruses). 

It's clear that the best strategy in order to obtain plants resistant 
to a wide spectrum of geminiviruses is the one in which the interfering 
product is the protein. It is clear that the width of the resistance spectrum 
35 increases the agronomic and commercial value of the produced plant. 

Thereby the expression in transgenic plants of dysfunctional 
variants of geminivirus replicative Rep protein has been used in order to 
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obtain plants with greater levels of resistance or immunity against the 
geminiviruses. 

It's known in literature that the expression of a truncated 
replicative Rep protein (Rep-210) of TYLCSV is able to confer resistance 
against viral infection, although such resistance is not lasting because the 
virus is able to overcome it over time. 

In tables 1 and 2 are shown the results of the analysis of the 
resistance of TYLCSV-agroinoculated Rep-210 expressing transgenic 
plants of Tomato 47 x wt (Brunetti et al. 1997) and of N. benthamiana line 
102.22 (Noris et ai. 1996) respectively. 



Table 1 



Lycopersicon 
esculentum 


Time 
(weeks) 


N° infected 

plants/inoculated 

plants 


% infected 

plants/inoculated 

plants 


Rep-21 0 


4 


0/13 


0. 




9 


2/13 


15 




18 


5/13 


38 


Wild-type 


4 


6/6 


100 



Table 2 



Nicotiana 
benthamiana 


Time 
(weeks) 


N° infected 

plants/inoculated 

plants 


% infected 

plants/inoculated 

plants 


Rep-210 


2 


4/21 


19 




3 


11/21 


52 




4 


18/21 


86 


Wild-type 


2 


6/6 


100 



From the results reported in tables 1 and 2, it can be clearly 
inferred that the resistance against TYLCSV mediated by the transgenic 
expression of a pathogen-derived sequence, is overcome with time. 

Similarly, also the resistance induced by the transgenic 
expression of a dominant negative mutant of Rep of the bipartite 
geminivirus "African Cassava Mosaic Virus" is overcome with time 
(Sangareetal., 1999). 

Another example is represented by the transgenic expression of 
the TYLCV capsid protein in a tomato interspecific hybrid {Lycopersicon 
esculentum X L pennellii) which confers a partial resistance against viral 
infection (Kunik et al., 1994). Even in this case the resistance mediated by 
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the expression of the capsid protein is not long lasting and it results to be 
poorly useful from an agronomic point of view. 

In the light of the above, is clear the need to have new methods 
that would allow to use successfully the polynucleotide sequences derived 
5 from the geminiviruses in order to obtain long lasting resistant plants, 
against geminiviruses. 

The authors of the present invention have now prepared 
polynucleotide sequences encoding pathogen-derived viral proteins and 
able to confer virus resistance to the host, suitably modified in order to be 
10 ineffective targets of the virus-induced post-transcriptional gene silencing 
to obtain transgenic plants with lasting levels of resistance against 
geminiviruses. 

In fact during the experiments the authors show that the 
overcoming of the resistance, and therefore the difficulty to achieve lasting 
15 resistance against geminiviruses, is due to the unexpected abilities of the 
geminiviruses to silence post-transcriptionally the transgene and to spread 
in a plant in which the transgene, with sequences homologous to the., 
infecting virus, is post-transcriptionally silenced. 

As shown in figures 2 and 3, respectively, both in the transgenic 
20 plants of A/, benthamiana line 102.22 and in the plants of Tomato 47 x wt 
the virus ability to overcome the resistance results from the transgene 
silencing by the same virus and from the unexpected ability of the virus to 
spread in a silenced plant. 

The TYLCSV ability to spread in a plant in which the transgene 
25 Rep-210 is post-transcriptionally silenced, is further circumstantiated as 
set forth in figure 4. 

The results show that the transgenic tomato plants 47 x 10D (Brunetti et 
al., 1997), post-transcriptionally silenced before agroinoculation, as shown 
by the absence of the Rep-210. protein and by the concurrent presence of 

30 the transgene-homologous siRNAs, are susceptible to the TYLCSV 
infection as well as the controls. 

From the above it results that, contrary to RNA viruses, the 
geminivirus is not blocked by an active silencing of viral gene sequences. 
The above said is not limited to the kind of transgenic plant to be used or 

35 the way the virus should be inoculated, through agroinfection or Bemisia 
tabaci. In fact, as shown in table 3, using a reduced number of viruliferous 
bemisia per plant, so as to infect between 90% and 100% of the control 
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plants, about 40% of transgenic plants (line 201) whose transgene is post- 
trascriptionally silenced, are not or late infected, while at a higher inoculum 
concentration, all the plants challenged with viruliferous insects are 
infected similarly to the experiments carried out using agroinoculation. 
5 Table 3 



Molecular Analysis 
before inoculum 


Low concentration of 
inoculum 8 


High concentration 
of inoculum 43 


Transgenic plants 


2 c 


3 


6 


2 


3 


6 


Rep-210 
(No) 


siRNAs 
(Si) 


6/15 


7/15 


8/15 


16/21 


20/21 


21/21 


Not transg 


enic 


2 c 


3 


6 


2 


3 


6 


Rep-210 
(No) 


siRNAs 
(No) 


11/12 


11/12 


11/12 


8/8 


8/8 


8/8 



a Seven viruliferous insects per plant for 2 days 
b Thirty-five viruliferous insects per plant for 5 days 
c Weeks after inoculum 



Therefore it's important" to consider that the viral agroinoculation 
10 conditions used for testing the resistance and assessing persistence over 
time (as shown in figures 2, 3 and 4 and in tables 1 and 2) correspond to 
high or very high viral pressure conditions. This experimental approach 
allows to identify transgenic plants with very high resistance levels or 
immune against the viral infection and therefore of very high commercial 
15 value. 

Accordingly, the introduction of resistance characters against 
geminiviruses through the expression of pathogen-derived sequences is 
limited due to the unexpected ability of the geminiviruses to silence post- 
trascriptionally the transgene and to spread in the silenced plant. 

20 Furthermore the authors show that the transcripts both of 

positive (V1 and V2) and negative strand (C1, C2, C3 and C4) of TYLCSV 
are subjected, during a normal infection on wild-type plants, to the viral 
post-trascriptional silencing, as shown in figure 5. This results in the 
impossibility to achieve long-term resistance through expression of 

25 sequences derived from the same pathogen, unless these are suitably 
modified in order not to be a target or to be an ineffective target of the 
virus-induced post-trascriptional gene silencing. Instead, by introducing in 
the plant genome a sequence suitably mutated or chosen according to the 
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invention it is possible to obtain a long lasting resistance against 
geminiviruses, unlike that achieved with the known methods. 

Therefore it is an object of the present invention a 
polynucleotide sequence encoding an aminoacidic sequence derived from 
5 geminiviruses, said polynucleotide sequence being characterised in that it 
is not a target or it is an ineffective target of the viral post-trascriptional 
silencing and having: . 

a) a nucleotide homology lower or equal to 90% with respect to 
the corresponding gene sequence of the geminiviruses against which a 

10 resistance is required, preferably lower or equal to 80%, more preferably 
lower or equal to 70 %; 

b) a continuous homology in the RNA transcript, with respect to 
the corresponding gene sequence of the geminiviruses against which a 
resistance is required, lower or equal to 17 nucleotides, preferably lower or 

15 equal to 8 nucleotides, more preferably lower or equal to 5 nucleotides; 

c) a maximum length of the sequence containing a single 
substitution with respect to the corresponding gene sequence of. the 
geminiviruses no longer than 30 nucleotides, preferably no longer than 20 
nucleotides, more preferably equal or lower than 9 nucleotides; 

20 said polynucleotide sequence being able to confer to the whole 

plants, tissues or plant cells therewith transformed, a lasting resistance 
against the geminiviruses. 

The polynucleotide sequences according to the invention can 
be wild-type or synthetic or produced by mutagenesis and the geminivirus- 

25 derived aminoacidic sequences encoded by them are wild-type or mutant 
sequences that interfere with the viral infection. 

Therefore the invention includes polynucleotide sequences of 
geminivirus either suitably changed or wild-type, such as to differ, at the 
nucleotidic level, with respect to the corresponding genomic sequence of 

30 the geminivirus against which it is required to introduce resistance 
according to the principles above defined and specified in a), b) and c). 

Further object of the present invention is a polynucleotide 
sequence encoding a geminivirus-derived aminoacidic sequence, said 
polynucleotide sequence being characterised in that it is not a target or it 

35 is an ineffective target of the post-trascriptional silencing and having 
homology even equal to 100% with respect to the sequence of the 
geminivirus against which it is required a resistance and being shortened 
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so as to be underrepresented in the siRNAs population with respect to the 
original sequence, even if maintaining similar interfering abilities. 

The gene sequences from which constructing the 
polynucleotide sequence according to the invention can derive from the 
5 geminiviruses such as, Mastrevirus, Curtovirus, Begomovirus, Topocuvirus 
and particularly can be derived from the species shown in table 4 . and their 
isolates, more particularly from the species of Tomato yellow leaf curl and 
their isolates shown in table 5. 



Table 4 



List of geminivirus species 


Acronym 


African cassava mosaic virus 


ACMV 


Bean calico mosaic virus 


BcaMV 


Bean dwarf mosaic virus 


BDMV 


Bean golden mosaic virus 


BGMV 


Bean golden yellow mosaic virus 


BGYMV 


Cabbage leaf curl virus 


CaLCuV 


Chilli leaf curl virus 


ChiLGuV 


Cotton leaf crumple virus 


CLCrV 


Cotton leaf curl Alabad virus 


CLCuAV 


Cotton leaf curl Gezira virus 


CLCuGV 


Cotton leaf curi Kokhran virus 


CLCuKV 


Cotton leaf curl Multan virus 


CLCuMV 


Cotton leaf curl Rajasthan virus 


CLCuRV 


Cowpea golden mosaic virus 


CPGMV 


Cucurbit leaf curl virus 


CuLCuV 


East African cassava mosaic Cameroon virus 


EACMCV 


East African cassava mosaic Malawi virus 


EACMMV 


EastAfrican cassava mosaic virus 


EACMV 


East African cassava mosaic Zanzibar virus 


EACMZV 


Indian cassava mosaic virus 


ICMV 


Ipomea yellow vein virus 


IYW 


Melon chlorotic leaf curl virus 


MCLCuV 


Mungbean yellow mosaic India virus 


MYMIV 


Mungbean yellow mosaic virus 


MYMV 


Okra yellow vein mosaic virus 


OYVMV 


Papaya leaf curl virus 


PaLCuV 


Pepper golden mosaic virus 


PepGMV 
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Pepper huasteco yellow vein virus 


PHYW 


Pepper leaf curl Bangladesh virus 


PepLCBV 


Pepper leaf curl virus 


PepLCV 


Potato yellow mosaic Panama virus 


PYMPV 


Potato yellow mosaic Trinidad virus k 


PYMTV 


Potato yellow mosaic virus 


PYMV 


South African cassava mosaic virus 


SACMV 


Soybean crinkle leaf virus 


SbCLV 


Squash leaf curl China virus 


SLCCNV 


Squash leaf curl virus 


SLCV 


Squash leaf curl Yunnan virus 


SLCYV 


Squash mild leaf curl virus 


SMLCV 


Squash yellow mild mottle virus 


SYMMoV 


Sri Lankan cassava mosaic virus 


SLCMV 


Sweet potato leaf curl Georgia virus 


SPLCGV 


Sweet potato leaf curl virus 


SPLCV 


Tobacco curly shoot virus 


TbCSV. 


Tobacco leaf curl Japan virus 


TbLCJV 


Tobacco leaf curl Kochi virus 


TbLCKoV 


Tobacco leaf curl Yunnan virus 


TbLCYNV 


Tobacco leaf curl Zimbabwe virus 


TbLCZV 


Tomato chlorotic mottle virus 


ToCMoV 


Tomato golden mosaic virus 


TGMV 


Tomato golden mottle virus 


ToGMoV 


Tomato leaf curl Bangalore virus 


ToLCBV 


Tomato leaf curl Bangladesh virus 


ToLCBDV 


Tomato leaf curl Gujarat virus 


ToLCGV 


Tomato leaf curl Kamataka virus 


ToLCKV 


Tomato leaf curl Laos virus 


ToLCLV 


Tomato leaf curl Malaysia virus 


ToLCMV 


Tomato leaf curl New Delhi virus 


ToLCNDV 


Tomato leaf curl Sri Lanka virus 


ToLCSLV 


Tomato leaf curl Taiwan virus 


ToLCTWV 


Tomato leaf curl Vietnam virus 


ToLCW 


Tomato leaf curl virus 


ToLCV 


Tomato mosaic Havana virus 


ToMHV 


Tomato mottle Taino virus 


ToMoTV 
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Tomato mottle virus 


ToMoV 


Tomato rugose mosaic virus 


ToRMV 


Tomato severe leaf curl virus 


ToSLCV 


Tomato severe rugose virus 


ToSRV 


Tomato yellow leaf curl China virus 


TYLCCNV 


Tomato yellow leaf curl Gezira virus 


TYLCGV 


Tomato yellow leaf curl Malaga virus 


TYLCMalV 


Tomato yellow leaf curl Sardinia virus 


TYLCSV 


Tomato yellow leaf curl Thailand virus 


TYLCTHV 


Tomato yellow leaf curl virus 


TYLCV 


Watermelon chlorotic stunt virus 


WmCSV ! 


Wheat dwarf virus 


WDV 


Maize streak virus 


MSV 


Sugarcane streak virus 


SSV 


Bean yellow dwarf virus 


BYDV 


Tobacco yellow dwarf virus 


TYDV 


Tomato pseudo curly top virus 


TPCTV • 


Beet curly top virus 


BCTV 



Table 5 



Species of tomato yellow leaf curl (Fauquet et a/., 2003) 


Acronym 


Tomato yellow leaf cud China virus 


TYLCCNV 


Tomato yellow leaf curl China virus AF311734 


TYLCCNV 


Tomato yellow leaf curl China virus - [Y64] AJ457823 


TYLCCNV-rY641 


Tomato yellow leaf curl China virus - Tb [Y10] AJ319675 


TYLCCNV- 
TbrY10] 


Tomato yellow leaf curl China virus - Tb [Y11] AJ319676 


TYLCCNV- 
Tb[Y11] 


Tomato yellow leaf curl China virus - To [Y25] AJ457985 


TYLCCNV- 
Tb[Y25] 


Tomato yellow leaf curl China virus - Tb [Y3l6] AJ420316 


TYLCCNV- 
Tb[Y36] 


Tomato yellow leaf curl China virus - Tb [Y38] AJ420317 


TYLCCNV- 
Tb[Y38] 


Tomato yellow leaf curl China virus - Tb [Y5] AJ319674 


TYLCCNV-Tb[Y5l 


Tomato yellow leaf curl China virus - Tb [Y8] AJ319677 


TYLCCNV-Tb[Y8] 
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Tomato yellow leaf curl Gezira virus 


TYLCGV 


Tomato yellow leaf curl Gezira virus - [1] AY044137 


TYLCGV-[1] 


Tomato yellow leaf curl Gezira virus - [2] AY044138 


TYLCGV-[2] 


Tomato yellow leaf curl Gezira virus - [Shambat] AY044139 


TYLCGV-[Sha] 


Tomato yellow leaf curl Malaga virus 


TYLCMalV 


Tomato yellow leaf curl Malaga virus AF271234 


TYLCMalV 


Tomato yellow leaf curl Sardinia virus 


TYLCSV 


Tomato yellow leaf curl Sardinia virus X61 153 . 


TYLCSV 


Tomato yellow leaf curl Sardinia virus - Spain [1] Z25751 


TYLCSV-ES[1] 


Tomato yellow leaf curl Sardinia virus - Spain [2] L27708 


TYLCSV-ES[2] 


Tomato yellow leaf curl Sardinia virus - Sicily Z28390 


TYLCSV-Sic 


Tomato yellow leaf curl Thailand virus 


TYLCTHV 


Tomato yellow leaf curl Thailand virus - [1] X63015, X63016 


TYLCTHV-[1] 


Tomato yellow leaf curl Thailand virus - [2] AF141922, 
AF141897 


TYLCTHV-[2] 


Tomato yellow leaf curl Thailand virus - [Myanmar] AF206674 


TYLCTHV-rMM] 


Tomato yellow leaf curl Thailand virus - [Y72] AJ495812 


TYLCTHV-[Y72] 


Tomato yellow leaf curl virus 


TYLCV 


Tomato yellow leaf curl virus X15656 


TYLCV 


Tomato yellow leaf curl virus - [Almeria] AJ489258 


TYLCV-[Alm] 


Tomato yellow leaf curl virus - [Aichi] AB014347 


TYLCV-[Aic] 


Tomato yellow leaf curl virus - [Cuba] AJ223505 


TYLCV-[CU] 


Tomato yellow leaf curl virus - [Dominican Republic] AF024715 


TYLCV-[DO] 


Tomato yellow leaf curl virus - [Portugal] AF1 05975 


TYLCV-rPT] | 


Tomato yellow leaf curl virus - [Saudi Arabia] 


TYLCV~[SA] ! 


Tomato yellow leaf curl virus - [Shizuokua] AB014346 


YLCV-[Shi] 


Tomato yellow leaf curl virus - [Spain7297] AF071228 


TYLCV-[ES7297] 


Tomato yellow leaf curl virus - Iran AJ132711 


TYLCV-IR 


Tomato yellow leaf curl virus - Mild X76319 


TYLCV-MId 



Preferably the species of Begomovirus are TYLCCNV, TYLCGV, 
TYLCMalV, TYLCSV, TYLCTHV, TYLCV, ACMV, BGMV, CaLCuV, 
ToCMoV, TGMV, ToGMoV, ToMHV, ToMoTV, ToMoV, ToRMV, ToSLCV, 
ToSRV, Cotton leaf curl (CLCrV, CLCuAV, CICuGV, CLCuKV, CLCuMV, 



5 CLCuRV), East African cassava mosaic (EACMCV, EACMMV, EACMV, . 
EACMZV), Potato yellow mosaic (PYMPV, PYMTV, PYMV), Squash leaf 
curl (SLCCNV, SLCV, SLCYV), Sweet potato leaf curl (SPLCGV, SPLCV), 
Tobacco leaf curl (TbLCJV, TbLCKoV, TbLCYNV, TbLCZV), Tomato leaf 
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curl (ToLCBV, ToLCBDV, ToLCGV, ToLCKV, ToLCLV, ToLCMV, 
ToLCNDV, ToLCSLV, ToLCTWV, ToLCW, ToLCV) and isolates thereof. 

Other species of preferred, geminivirus, belonging to the other 
genera Mastrevirus, Curtovirus, Topocuviruses, are WDV, MSV, SSV, 
5 BYDV, TYDV, BCTV and their isolates. 

The gene sequence belonging to the genome of the 
geminiviruses can be the sequence C1/AL1/AC1, C2/AL2/AC2, 
C3/AL3/AC3, C4/AL4/AC4, V1/AR1/AV1, V2/AR2/AV2, BC1/BL1 and 
BV1/BR1, particularly, the sequence C1/AL1/AC1 of the previously 
10 described geminiviruses and their isolates. 

The aminoacidic sequence encoded by the polynucleotide 
sequence object of the present invention is a pathogen-derived protein 
able to confer resistance against the geminiviruses to the plants 
expressing it. Said interfering protein since, according to the invention, is 
15 stably expressed, confers a lasting resistance independently from the 
molecular mechanism by which the protein product is able to induce 
resistance. 

The pathogen-derived protein can be a capsid protein, 
replication-associated viral protein (Rep), proteins encoded by the genes 
20 C2/AL2/AC2, C3/AL3/AC3, C4/AL4/AC4, V2/AR2/AV2, BC1/BL1 and 
BV1/BR1. 

An example of a possible polynucleotide sequence satisfying 
the above reported requirement is set forth in figures 16A and 16B that 
show the alignment between the wild-type nucleotide sequence encoding 
25 the Rep-210 protein of the TYLCSV and the synthetic nucleotide 
sequence modified so as not to be a target of the post-trascriptional 
degradation induced by the infecting virus, where both nucleotide 
sequences encode the same viral protein. 

The plants, tissues or plant cells that can be transformed with 
30 this polynucleotide sequences can be tomato, pepper, tobacco, sweet 
potato, cotton, melon, squash, manioc, potato, bean, soybean, mung 
bean, beet, sugar cane, corn, wheat. 

It is a further object of the present invention a construct 
comprising an heterologous polynucleotide sequence containing in 5»-3 ' 
35 direction: 

a) a polynucleotide sequence acting as promoter in said plant 
or tissue or transformed cells; 
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b) a non-translated polynucleotide sequence positioned at 5' of 
the encoding region, belonging or not to the intergenic region of 
geminivirus; 

c) a polynucleotide sequence according to the invention or a 
fragment or a variant thereof; 

d) a sequence acting as terminator of transcription, positioned 
at the 3' with respect to said polynucleotide sequence. 

A further object of the present invention is an expression vector 
comprising the previously described construct. 

Further it is an object of the present invention a plant, tissue or 
transgenic plant cells, progeny thereof as well as seeds comprising in their 
genome a polynucleotide sequence according to the present invention. 

Finally, it is an object of the present invention a method for the 
preparation of transgenic plants, tissues or plant cells thereof long-lasting 
resistant to the geminiviruses that comprises the following steps: 

a) "identification" or "selection" of a viral gene sequence 
encoding an aminoacidic sequence able to confer -resistance against 
geminiviruses; 

b) mutagenesis or "choice" of the viral gene sequence so as to 
make it an ineffective target of the post-trascriptional silencing induced by 
infecting geminivirus; 

c) insertion of the geminivirus mutated or chosen gene 
sequence obtained in step b) through a construct as described previously, 
in the plant, tissue or plant cell thereof. 

With reference to step a) of the method according to the 
present invention, the term "identification" means the experimental 
recognition of said viral gene sequence able to confer resistance against 
geminiviruses, while the term "selection" means the recognition of an 
already available viral gene sequence able to confer a not lasting 
resistance against geminiviruses. Accordingly, the method according to 
the present invention provides furthermore the solution to the problem of 
the loss of resistance against geminiviruses that occurs through the 
employment of known sequences. 

Particularly, the mutagenesis predicted in step b) is carried out 
maintaining a nucleotide homology, with respect to the corresponding 
gene sequence of the geminiviruses against which it is required to obtain a 
resistance, lower or equal to 90%, preferably lower or equal to 80%, more 
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preferably lower or equal to 70%, distributed so as the continuous 
homology in the transcribed RNA with respect to the corresponding 
sequence of geminiviruses is lower or equal to 17 nucleotides, preferably 
lower or equal to 8 nucleotides, more preferably lower or equal to 5 
5 nucleotides and the maximum length of the sequence containing a single 
substitution with respect to the native gene sequence is not more than 30 
nucleotides, preferably not more than 20 nucleotides, more preferably 
lower or equal to 9 nucleotides. 

As the aminoacidic sequence encoded by the polynucleotide 
10 sequence identified or selected in step a), according to the present 
invention, it can be a protein having homology of 100% with respect to the 
viral wild-type protein. 

This mutagenesis includes all those mutations on the nucleotide 
sequence that don't decrease the ability of the protein to confer resistance 
15 against geminivirus. Possible mutations are both silent point mutations 
and those leading to the substitution with aminoacids having similar 
'biochemical characteristics, or deletions .and/or. insertions, and/or 
substitutions. 

Alternatively, the mutagenesis in step b) of the method 

20 according to the present invention consists of deletions of the 
polynucleotide sequence at the extremities so as said sequence, while 
maintaining similar interfering abilities, is under-represented with respect 
to the original sequence, in the natural population of the siRNAs produced 
by the infecting virus. 

25 Alternatively the "choice" in step b) of the method according to 

the present invention consists in the recognition of geminivirus wild-type 
sequences that differ at the nucleotidic level from the geminivirus against 
which it is required resistance so as not to be a target or to be an 
ineffective target of the post-trascriptional silencing. 

30 Particularly, the mutagenesis action in step b) of the method 

according to the present invention can consist of deletions of the 3' or 5 * 
region of the viral gene sequence of step a), until it is identified the 
minimum region of said gene sequence that is under-represented with 
respect to the sequence encoding a wild-type protein, in the population of 

35 the siRNAs and that said truncated protein maintains the ability to confer 
resistance against geminiviruses. 
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Moreover, the viral gene sequence of step a) of the method 
according to the present invention can be that of TYLCSV C1/AL1/AC1 
gene and the aminoacid sequence can be a protein truncated relatively to 
the viral wild-type protein such as, for instance, Rep-130. 

Among various agronomic applications of the synthetic 
polynucleotide sequences according to the present invention, of particular 
interest is their use for obtaining tomato plants resistant to TYLCSV. 

In this particular embodiment, the transgenic polynucleotide 
sequence encoding the truncated viral Rep protein (Rep-210) has been 
modified through a 3 'deletion resulting in an ineffective target of TYLCSV- 
induced post-trascriptional gene silencing, while maintaining the ability to 
confer resistance. 

In particular, using stringent hybridizations with radioactive RNA 
probes, it was identified a transcribed region of the TYLCSV genome that 
is under-represented in the population of viral-origin siRNAs produced 
during the infection of the TYLCSV in wild-type plants, as shown in figures 
6' and 7. This region corresponds to the first 390 nucleotides .of the gene 
encoding the TYLCSV Rep. Its transcript is an ineffective target of virus- 
induced post-trascriptional gene silencing, it encodes the Rep-130 protein 
which results stably expressed, resulting in a lasting resistance. 

Therefore, in a particular embodiment of the invention, the 
aminoacid sequence of geminiviruses, such as the TYLCSV, encoded by 
the polynucleotide sequence according to the invention can be the 
truncated Rep-130 protein (SEQ ID No 9). In this case the viral gene 
sequence made an ineffective or non target of the post-trascriptional 
silencing, is the SEQ ID No 8. 

It's a further object of the present invention a method as 
described above wherein the mutagenesis in step b) consists of silent 
point mutations .of the viral gene sequence of step a) that maintain the 
ability of the encoded aminoacid sequence, to confer resistance against 
geminiviruses and to be an ineffective or non target of the post- 
trascriptional silencing. 

Particularly, the viral gene sequence of step a) can be the 
V1/AR1/AV1 (CP) gene for instance of TYLCSV (SEQ ID No 12), and in a 
particular embodiment the viral gene sequence made ineffective or non 
target of the post-trascriptional silencing is the SEQ ID No 6. 
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In addition, the viral gene sequence of step a) can be the 
TYLCSV C1/AL1/AC1 gene, and in this case the viral gene sequence 
which was made ineffective or non target of the post-trascriptional 
silencing can be the SEQ ID No 2 or the SEQ ID No 4. 

The present invention now will be described by way of 
illustrating but not limiting way, according to preferred embodiments 
thereof, with particular reference to the figures of the enclosed 
drawings, wherein: 

figure 1 shows the genome of tomato yellow leaf curl Sardinia 
specie virus (TYLCSV). DNA is transcribed bidirectionally and it contains 
six open reading frames partially or totally overlapping (ORF), two on the 
viral strand (V) V1 and V2 and four on the complementary strand (C) C1, 
C2, C3 and C4; 

figure 2 shows the expression of Rep-210 protein in TYLCSV- 
agroinoculated transgenic A/, benthamiana plants (line 102.22). The 
symbols (-), (+) and Nl mean healthy, infected and non-inoculated plants, 
respectively. Analysis has been carried out before the agroinoculation with 
TYLCSV (0 wpi) and four and eight weeks after it (respectively 4 and 8 
wpi); 

figure 3 shows the expression of the Rep-210 protein and of the 
transgenic mRNAs in tomato plants (line 47 X wt) before the 
agroinoculation with TYLCSV (0 wpi) and 22 weeks after it (22 wpi). The 
symbols (+) and (-) mean plants that are, respectively, infected or healthy 
at the specified time; 

figure 4 shows the analysis of expression of Rep-210 protein 
and of the siRNAs corresponding to the relative transcript in transgenic 
tomato plants (47 X 10D line) before TYLCSV agroinoculation. The 
symbols (+) and (-) on the panel mean the presence and absence of the 
sense and antisense C1 transgene, respectively; wt means control wild- 
type plant; while the symbols (+), (-) and Nl under the panel of the siRNAs 
mean the presence (+) and absence (-), respectively, of the virus and the 
non-agroinoculation (Nl) as a control; 

figure 5 shows the Northern blot of the small RNAs extracted 
from TYLCSV-infected wild-type tomato plants (samples 1-4) and non- 
infected as a control (sample C); M means a molecular weight marker; 

figure 6 shows the distribution analysis of the small interfering 
RNAs with respect to the genome of the TYLCSV. On the top, the linear 
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map of the TYLCSV genome; transcripts are pointed out by the arrows (V1 
and V2 with the same polarity as the viral genome and C1, C2, C3 and C4 
with complementary polarity); open boxes from 1 to 9 positioned under the 
viral genome map represent the nine PCR fragments (each about three 
hundred nucleotides in length), in which the genome has been divided and 
of which the ethidium bromide staining and hybridization with the siRNAs 
extracted by TYLCSV-infected tomato plants are shown. IR represents a 
tenth PCR fragment, corresponding to the non-transcribed intergenic 
region of the TYLCSV genome. The numbers under the panels mean the 
percentage of hybridization signal with respect to each PCR fragment; 

figure 7 shows the analysis of the presence of siRNAs in non- 
agroinoculated (sample C) or TYLCSV-agroinoculated (samples 1 and 2) 
wild-type tomato plants at four weeks after inoculation; particularly the 
siRNAs corresponding to the transcript for Rep-210 (probe A) and for Rep- 
130 (probe B) were analyzed; the columns 100 and 50 on the left panels 
correspond to 100 and 50 pg, respectively, of an oligonucleotide 
homologous to both the A and B probes; 

figure 8 shows the nucleotide sequence (SEQ ID No 8) 
encoding Rep-130 (SEQ ID No 9) of the pTOM130 plasmid. In capital, 
nucleotides not belonging to the TYLCSV but deriving from cloning; the 
underlined sequences correspond to BamHI and EcoRI restriction sites 
used for cloning. The start and stop codons are set forth in boldfaced while 
the mutations introduced for eliminating the C4 protein expression are in 
italic and boldfaced characters; 

figure 9 shows the Southern blot of total nucleic acids extracts 
from wild-type N. benthamiana protoplasts cotransfected with a TYLCSV 
infectious clone (pTOM6), along with the plasmid expressing the mutated 
Rep protein indicated above each column; 

figure 10 shows the quantitative analysis of the TYLCSV 
replication in wild-type N. benthamiana protoplasts cotransfected with 
pTOM6 plasmid along with the plasmid expressing the mutated Rep 
protein; 

figure 11 shows the scheme of pTOM130 plasmid used for 
obtaining Rep-1 30-expressing transgenic plants. LB and RB mean left- 
and right-border respectively; pE35S represents the duplicated Cauliflower 
Mosaic Virus 35S promoter; Rep-130 (SEQ ID No 8) is the sequence 
encoding the Rep-130 protein (SEQ ID No 9); t35S is the Cauliflower 
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Mosaic Virus 35S terminator; tNOS is the terminator of the gene encoding 
the nopalin. synthase; nptll is the sequence encoding the neomycin 
phosphotransferase; pNOS is the promoter of the gene for the nopalin 
synthase; Kan is the gene for the kanamycin resistance; 
5 figure "12 shows the analysis of the expression of Rep-130 

protein (SEQ ID No 9) in transgenic N. benthamiana plants transformed 
with pTOM130 (lines 300-309); 

figure 13 shows the analysis of the TYLCSV replication in wild- 
type (wt) and transgenic N. benthamiana protoplasts expressing either the 

10 Rep-130 protein (SEQ ID No 9) (lines 300, 301, 303) or the Rep-210 
protein (102.22). 

figure 14 shows the analysis of the expression of Rep-130 
protein (SEQ ID No 9) in transgenic L esculentum plants transformed with 
pTOM1 30 (lines 402, 403, 406, 41 1 , 41 3, 41 6, 41 7). A protein extract from 

15 a transgenic Rep-130 expressing N. benthamiana (line 303) was used as 
positive control; 

figure 15 shows the comparison between a L esculentum 
transgenic plant transformed with pTOM130 (line 406) expressing Rep- 
130 and a non-transformed wild-type plant; 

20 figure 16 A and B shows two examples of synthetic sequences 

encoding Rep-210 (SEQ ID No 2, SEQ ID No 4). The alignment between 
the wild-type nucleotide sequence encoding TYLCSV Rep-210 protein 
(Seq_codJ^ep210_wild_type, on the top; SEQ ID No 1) and the synthetic 
nucleotide sequence modified so as to be an ineffective target of the virus- 

25 induced post-trascriptional silencing is shown 

(SecLcod_Rep210_silencing_minus, in the bottom; SEQ ID No 2, SEQ ID 
No 4). In the synthetic sequences, the mutated nucleotides with respect to 
the wild-type sequence are shaded; 

figure 17 shows the analysis of transient expression, by 

30 agroinfiltration into N. benthamiana leaves, of Rep-210 protein encoded by 
the plasmid wild-type gene pTOM102(C4 -) and by the synthetic gene 
Rep-210 silencing minus B (SEQ ID No 4), (piasmid pTOM102 Syn); 

figure 18 shows a transient assay for the inhibition of viral 
replication through co-agroinfiltration of an A. tumefaciens strain 

35 containing the TYLCSV infectious clone along with A. tumefaciens strains 
containing: a) the pTOM102(C4 -) plasmid expressing the wild-type gene 
for Rep-210 SEQ ID No 1 (Brunetti et al., 2001), lines 1-3; b) the 
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pTOM102Syn plasmid expressing the synthetic gene, for Rep-210 (Rep- 
210 silencing minus B; SEQ ID No 4) lines 4-6; c) the empty cloning 
plasmid pBIN19 lines 7-9; 

figure 19 shows the analysis of the expression of Rep-210 

5 protein in transgenic N. benthamiana plants transformed with 
pTOM102Syn plasmid containing the synthetic gene for Rep-210, Rep- 
210-silencing minus B, SEQ ID No 4 (lines 506, 508A and 508B). A 
protein extract from Rep-210-expressing transgenic tomato plant was 
used as positive control; 

10 figure 20 shows the analysis of the expression of Rep-210 

protein in transgenic N. benthamiana plants transformed with pTOM 102 
(line 102.22, Noris et al., 1996) or with pTOM 102Syn (line 506) after 
TYLCSV agroinoculation. Analysis has been performed before (Owpi) and 
five weeks after (5wpi) TYLCSV agroinoculation; 

15 figure 21 shows the analysis of the infection by "dot-blot" assay 

at 2, 3, 4, 5 wpi (where wpi means the number of weeks after the 
agroinoculation) on N. benthamiana wild-type (WT) or transgenic plants 
transformed with either pTOM 102 (line 102.22, Noris et al., 1996) or 
pTOM102Syn (line 506); 

20 figure 22 shows an example of synthetic sequence encoding 

CP. The alignment between the wild-type nucleotide sequence encoding 
TYLCSV CP (TYLCSV CP, on the top; SEQ ID No 12) and the synthetic 
nucleotide sequence modified so as to be an extremely ineffective or non 
target of the virus-induced post-trascriptional degradation (TYLCSV CP 

25 silencing minus, in the bottom; SEQ ID No 6) is shown. In the synthetic 
sequence, the mutated nucleotides with respect to the wild-type sequence 
are shaded. 

EXAMPLE 1 : Identification of regions of the TYLCSV genome 
under-represented in the siRNAs population. 

30 In a natural infection by TYLCSV of wild-type plants, the viral 

sequences transcribed by both strands of TYLCSV genome are target of 
post-trascriptional gene silencing as pointed out by the presence of 
siRNAs homologous to different portions of the genome (figure 5). In figure 
5 is shown the Northern blot of total RNAs extracted from the tomato wild- 

35 type plants infected by TYLCSV (samples 1-4) and non-infected control 
(sample C). Probe and the restriction sites used are indicated aside each 
panel. Also the estimated sizes of siRNAs are set forth. 
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In order to evaluate if some regions of the TYLCSV genome 
constitute a target of post-trascriptional gene silencing less effective than 
others, it was performed a systematic study of the siRNA distribution with 
respect to their position on the viral genome. Therefore the TYLCSV 
genome has been divided in nine contiguous fragments, each of about 
three hundred base pairs (as drawn in figure 6), obtained by PCR with 
specific oligonucleotides. The same amount of such fragments has been 
transferred on nylon filter after agarose gel electrophoresis. Quantification 
of PCR fragments loaded on agarose gel has been performed by software 
Aida. The siRNAs produced by a TYLCSV-infected tomato plant have 
been purified starting from the total RNAs, terminally labelled and used as 
probe (Szittya et al., 2002) on the filter containing several regions of 
TYLCSV genome. The different intensity of the hybridization signals, 
referred to a same amount of loaded fragment, has been assessed 
through the TYPHOON apparatus (Amersharn-Pharmacia). So a different 
distribution of the siRNAs with respect to the several regions of the viral 
genome has been detected (fig.6). 

EXAMPLE 2 : Identification of a region of TYLCSV C1 gene 
under-represented in the si RNAs population. 

In order to identify a region of the TYLCSV C1 gene under- 
represented in the siRNAs population, total RNAs (Brunetti et al., 1997) 
both from healthy and TYLCSV-infected tomato plants have been 
extracted. 

Thirty micrograms of such RNAs have been submitted to 8% 
denaturing polyacrylamide gel electrophoresis and transferred by 
capillarity on nylon filter through Northern blot. Two identical replicas have 
been produced and for each it has been carried out an hybridization with 
probes corresponding to different portions of the 5' region of the C1 gene, 
as shown in figure 7. One filter has been hybridized with a probe derived 
from the 5' portion of C1 gene comprising 42 nt of non-translated leader 
sequence and the first 630 nucleotides of C1 gene (about 3/5 of C1 gene) 
(probe A) and the other filter with a probe derived from the 5' portion of C1 
gene comprising 42 nt of non-translated leader sequence and the first 390 
nucleotides of C1 gene (probe B). 

In order to quantitatively compare the results obtained by the 
two different probes (deriving from two independent labelling), scalar 
amounts of a same 40mer oligonucleotide complementary to both probes 
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have been loaded on both replicas. Columns 100 and 50 correspond to 
100 and 50 picograms of such oligonucleotide, respectively. The panels 
showing the oligonucleotide migration have been set close to the 
respective panels containing the siRNAs but their position in the figure 
doesn't correspond to the position on gel, because the oligonucleotide and 
the siRNAs have different molecular weights. 

Both probes after in vitro transcription have been submitted to 
alkaline hydrolysis (Cox et al., 1984) in order to obtain from them 
fragments with an average length of 75 nucleotides. 

The hybridizations have been performed for 16 hours at 39°C in 
the buffer described by Dalmay et al., 2000. After hybridization the filters 
have been washed in 2X SSC, 0,2% SDS twice for 10 minutes at 40°C, 
twice for 10 minutes at 45°C and once for 10 minutes at 50°C. 

It is remarkable how the proximal 5' region of the C1 gene in the 
siRNAs population is under-represented. Particularly, the quantitative 
analysis of the results performed through the TYPHOON apparatus 
(Amersham-Pharmacia) revealed that the siRNAs corresponding to this 5' 
region are about 25% (probe B) with respect to those corresponding to the 
region extended up to nucleotides encoding the 210 aminoacid (probe A). 
Said 5' region constitutes therefore an ineffective target for the virus- 
induced post-trascriptional gene silencing. 

These results have been confirmed using the method described 
in example 1, i.e., where PCR fragments corresponding to the two 
different regions of the C1 gene were hybridised with the population of 
siRNA extracted from tomato plants infected by TYLCSV. 

EXAMPLE 3 : Construction of a polynucleotide sequence of the 
C1 gene 5' portion encoding the truncated Rep. 

As previously pointed out (Brunetti et al., 1997), the Rep-210 
transgenic plants show a not long lasting resistance and an altered 
phenotype. 

As can be noticed in figure 1 , the C4 gene- is nested in the 
truncated C1 gene in a different reading frame. 

It is shown that the transgenic expression of geminivirus C4 
gene induces phenotype alterations (Krake et al., 1998). 

Therefore, it has been designed several truncated C1 
constructs unable to express C4 ORF. 
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in order to obtain C4 (-) mutants, a stop codon has been 
introduced in the C4 sequence through the introduction of two point 
mutations. Particularly, referring to the pTOM130 sequence set forth in 
figure 8 (SEQ ID No 8), the mutation at nucleotide 233 consists of a 
5 trasversion from C to G that converts the TCA codon (encoding serine) of 
the reading frame encoding C4 in TGA (opal). In addition, the mutation at 
nucleotide 231 consists of a transition from C to T that restores in the 
reading frame encoding C1 a leucine codon (CTC becomes TTG, more 
represented in plant). 

10 Thereby the translation of the C4 protein is interrupted after 

only 10 aminoacids, while the aminoacid sequence of the C1 protein 
remains unchanged. The two introduced mutations have been chosen 
among many possible mutations based on the criterion to generate a 
"strong" stop codon in the C4 reading frame, maintaining in the C1 reading 

15 frame a leucine codon compatible with codon usage in plants. 

Mutagenesis has been performed by PCR with the following 

mutated oligonucleotides: ... 

C4 plus.primer (SEQ ID No 10): ff-CT CAT CTC CAT ATT TTG 
ATC CAATTC GAA G-3' 

20 C4 minus.primer (SEQ ID No 11): 5'-C TTC GAA TTG GAT 

CAA AAT ATG GAG ATG AG-3 r (2419-2448 in TYLCV - Kheyr-Pour et 
al., 1991) 

Each of the two mutated primers has been used along with an 
external primer in two separate PCR reactions using pGEM102 as 
25 template (Brunetti et al., 2001 ). 

Particularly, the external oligonucleotides are Rev and Univ 
(M13/pUC sequencing primer n.1233 and 1224). From the reaction 
performed with Univ/C4plus it has been obtained a 537 bp fragment, while 
from the reaction with Rev/C4minus a 351 bp fragment. 
30 The obtained products have been used as templates for a 

following amplification reaction carried out using two external primers. 

The obtained PCR product has been digested with EcoRI and 
BamHI restriction enzymes and cloned into the corresponding sites of 
pJIT60, thus obtaining pJITR210. In both cases it has been carried out the 
35 sequencing to verify clones. 
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EXAMPLE 4 : Identification of the minimal 5' region of TYLCSV 
C1 gene that when expressed in plant ce//s is able to inhibit viral 
replication. 

In order to define the minimal 5* terminal region of C1 gene able 
to confer resistance against TYLCSV, a series of 3-terminal deletion 
mutants of C1 gene was cloned in pJIT60 expression vector, resulting in a 
pJTR series. 

The viral sequences have been amplified by PCR with Pfu DNA 
polymerase (Stratagene), using specific primers containing, restriction sites 
at the ends. 

The previously described pJITR210 plasmid, which encodes 
Rep-210, and contains a stop codon for the internal C4 protein, has been 
used as template. The fragments obtained by amplification reactions have 
been digested with BamHI and EcoRI enzymes and cloned in the 
corresponding sites of pJIT60 resulting in the pJTR series. 

All final clones have been sequenced in order to confirm the 
amplification fidelity and vector-insert junctions. The length and the precise 
positions of every amplified sequence are set forth in table 6. 

The ability of each Rep deletion mutant to confer resistance 
against TYLCSV has been evaluated through cotrasfection assays of N. 
benthamiana wild-type protoplasts with a TYLCSV infectious clone 
(pTOM6) along with each mutant, and following analysed for the 
replication level of the viral genome through Southern blot. The obtained 
results are set forth in figures 9 and 1 0. 



Table 6 



pJTR210a 


42 bp UTR + truncated C1 ORF (630 
nt) containing C4 encoding region 


1985-2656 (1) 


pJTR210 


42 bp UTR + truncated C1 ORF (630 
nt) 


1985-2656 (1) 


pJTR181 


42 bp UTR + truncated C1 ORF (543 
nt) 


2072-2656 (1) 


pJTR156 


42 bp UTR + truncated C1 ORF (468 
nt) 


2147-2656 (1) 


pJTR130 


42 bp UTR + truncated C1 ORF (390 
nt) 


2225-2656 (1) 


pJTR120 


42 bp UTR + truncated C1 ORF (360 
nt) 


2255-2656 (1) 
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pJTR80 


42 bp UTR + truncated C1 ORF (240 
nt) 


2375-2656 (1) 


pJTR54 


42 bp UTR + truncated C1 ORF (162 
nt) 


2453-2656(1) 



(1) nucleotide numbering of the TYLCSV genome are according to Kheyr- 
Pour et al. 1991. 



The protoplast cotransfection, total nucleic acid extraction and 
Southern analysis have been performed according to already described 
5 methods (Brunetti et al. 2001). 

Total nucleic acids extracts from each protoplast sample have 
been analysed through Southern blot with a digoxigenin-labelled RNA 
probe corresponding to the sequence encoding Rep-210, and the pGEM-P 
plasmid used as control. In particular figure 9 represents a Southern blot 
10 of total nucleic acids, where scDNA and.ssDNA mean supercoiled and 
single strand DNA of TYLCSV, respectively. 

For an accurate quantitative analysis of the effect of the 
expression of several truncated forms of Rep on the . replication of 
TYLCSV genome, a Southern analysis has been performed with a 32 P- 
15 labelled DNA probe corresponding to the region encoding the first 54 N- 
terminal aminoacids of Rep and the radioactivity level corresponding to 
each band detected on filter has been evaluated, through analysis with the 
Istant Imager apparatus (Canberra, Packard). 

Each mutated construct has been assayed in duplicate, in three 
20 independent experiments and the value set forth in figure 10 represents 
the average of two or three cotransfections of the three independent 
experiments. 

The level of TYLCSV replication in the cotrasfection 
experiments performed with pTOM6 along with pGEM-P control plasmid 
25 was considered equal to 100%. 

Particularly, figure 10 shows a quantitative analysis, the white 
and black bars of the histograms represents the amounts of supercoiled 
and single strand DNA, respectively; error bars indicates the mean 
standard deviation. 

30 As pointed out by observing figures 9 and 10, the first 130 N- 

terminal aminoacids of the Rep protein are enough to inhibit almost 
completely viral replication, while the expression of the first 120 N-terminal 
aminoacids has no influence. 
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EXAMPLE 5 : Production of N. benthamiana transgenic plants 
expressing Rep-130. 

The analysis of the ability to inhibit TYLCSV replication by the 
Rep mutants assessed through transient expression in protoplasts, has 
5 revealed that the shortest mutant still effective encodes Rep-130 (SEQ ID 
No 9) as described in the preceding example. 

Also it has been previously revealed in other examples that the 
proximal 5' portion of C1 gene encoding Rep-130 is a less effective target 
of post-trascriptional gene silencing compared to sequence encoding Rep- 
10 210. 

Therefore it has been obtained N. benthamiana transgenic 
plants expressing Rep-130. For this purpose, the pTOM130 plasrnid 
represented in figure 11 has been obtained, by cloning Kpnl-Bglll fragment 
of pJTR130 into the KpnI-BamHI sites of pBIN19. 
15 N. benthamiana has been transformed with the A tumefaciens 

pGV2260 C58 strain containing pTOM130 plasrnid and plants resistant to 

kanamycin have been regenerated as described (Noris et al._1996). _ 

The primary transformants have been analysed for the 
presence of transgene by PGR analysis and for the expression of Rep-130 
20 protein through Western blot, as shown in figure 12. 

The protein extracts obtained from transgenic (300-309) or wild- 
type control (wt) plants have been analysed by Western blot using an anti- 
TYLCSV Rep rabbit polyclonal primary antibody as described (Noris et al., 
1996). 

25 EXAMPLE 6 : Plant cells stably transformed with the pTOM130 

construct and expressing Rep-130 inhibit TYLCSV replication. 

In order to early evaluate the resistance conferred by Rep-130, 
protoplasts isolated from several primary transgenic Rep-1 30 expressing 
N. benthamiana plants were transfected with a TYLCSV infectious clone 

30 (pTOM6). 

Transgenic lines have been chosen for their high Rep-130 
expression, as revealed by Western blot analysis (figure 12). 

The level of TYLCSV replication in such transgenic protoplasts 
has been compared with that observed in A/, benthamiana wild-type and in 
35 transgenic protoplasts expressing Rep-210 (line 102.22). 

Particularly, figure 13 shows the analysis of TYLCSV replication 
in wild-type (wt) and transgenic N. benthamiana protoplasts expressing 
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either Rep-130 (SEQ ID No 9) (lines 300, 301, 303), or Rep-210 (line 
102.22). Total nucleic acids extracts from several protoplast samples have 
been analysed by Southern blot using a digoxigenin-labelled RNA probe 
corresponding to the Rep-210 transcript. 

In order to compare the level of TYLCSV replication in the Rep- 
1 30 transgenic protoplasts with that observed in wild-type protoplasts, the 
total nucleic acids extracted from wild-type protoplasts have been also 
loaded following 1:10 and 1:50 dilutions, as shown in figure 13. 

EXAMPLE 7 : Production of tomato transgenic plants expressing 

Rep-130 

The analysis of the ability to inhibit TYLCSV replication by Rep 
mutants, assessed by transient expression in protoplasts, has revealed 
that the shortest mutant still effective encodes Rep-130, as described in 
the previous example. 

As previously shown, the proximal 5' portion of C1 gene 
encoding Rep-130 is an ineffective target of post-trascriptional gene 
silencing compared with the sequence encoding Rep-21 0. . 

Therefore it has been carried out the production of transgenic 
tomato plants (Lycopersicon esculentum cv. Moneymaker) expressing 
Rep-130. 

The tomato has been transformed using A. tumefaciens 
pGV2260 C58 strain containing pTOM130 plasmid (figure 11) and 
kanamycin-resistant plants were regenerated as described (Brunetti et al. 
1997). 

The primary trasformants have been analysed for the presence 
of the transgene by PCR analysis and for the expression of Rep-130 
protein by Western blot (figure 14). 

The protein extracts obtained from transgenic (lines 400) or 
wild-type control (wt) plants have been analysed by Western blot using an 
anti-TYLCSV Rep polyclonal rabbit primary antibody as described (Noris 
et al. 1996). 

All transgenic tomato plants expressing Rep-130 protein are 
phenotypically impossible to distinguish from wild-type plants (figure 15). 

EXAMPLE 8 : Demonstration of long-lasting resistance against 
TYLCSV in plants transgenic for the pTOM130 construct expressing Rep- 
130 
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In order to assess the lasting of the resistance against TYLCSV 
conferred by Rep-130 expression, the N. benthamiana R1 transgenic 
plants expressing Rep-130 have been agroinoculated with the A. 
tumefaciens LBA4404 strain containing the TYLCSV infectious clone. 

As previously reported, the viral delivery through 
agroinoculation, used to assay the resistance and evaluate stability over 
time, corresponds to high or very high viral pressure conditions. 

Infection of plants has been assessed at weekly intervals by a 
"tissue printing" assay, using a digoxigenin-labelled probe specific for the 
coat protein gene. 

The results in table 7 show that, unlike the results described in 
table 2 concerning transgenic plants expressing Rep-210, transgenic N. 
benthamiana plants expressing Rep-130 protein show a long-lasting 
resistance when agroinoculated with TYLCSV. This can be deduced by 
comparison of the resistant plants at 2 and 6 weeks following inoculation. 



Table 7 



Plants 


Time 
(weeks) 


N°- infected 
plants/inoculated plants 


% infected plants / 
inoculated plants 


Rep-130 


2 


0/10 


0 




4 


0/10 


0 




6 


0/10 


0 


Wild-type 


2 


9/10 


90 




4 


10/10 


100 



In addition, it was assessed the stability of the resistance 
against TYLCSV conferred by Rep-130 expression in transgenic R2 
tomato plants. 



The plants have been agroinoculated with the A. tumefaciens 
C58C1+pCH32 strain containing TYLCSV infectious clone. The 
agroinoculation conditions used for assaying the resistance and evaluating 
stability thereof over time, correspond to high or very high viral pressure 
conditions. 
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The infection has been assessed at intervals of one or two 
weeks through dot-blot assay, using a radioactively labelled probe specific 
for the coat protein gene. 

The results in table 8 point out that tomato transgenic plants 
5 expressing Rep-130 show a long lasting resistance when agroinoculated 
with an TYLCSV infectious clone. This can be deduced from the 
comparison of the resistant plants at 3 and 12 weeks after inoculation. 



Table 8 



Plants 


Time 


N° infected 


% infected plants / 




(weeks) 


plants/inoculated plants 


inoculated plants 


Rep-130 


3 


0/9 


0 




4 


0/9 


0 




5 


0/9 


0 




6 


0/9 


0 




7 


0/9 


0 




8 


0/9 


0 




10 


0/9 


0 




12 


0/9 


0 


segregating 


3 


3/3 


100 


Rep-130 








Wild-type 


3 


4/4 


100 



Therefore the resistance is associated to the presence of Rep- 



10 130 protein (SEQ ID No 9) and to the ability of TYLCSV-inoculated 
transgenic plant to stably express Rep-130, because the sequence 
encoding it is an ineffective target of virus-induced post-trascriptional gene 
silencing and Rep, even if further mutated, maintains its ability to confer 
virus resistance. 

15 EXAMPLE 9 : Construction of a synthetic polynucleotide 

sequence, modified in order not to be ox to be an extremely ineffective 
target of the post-trascriptional gene silencing induced by the infecting 
virus, encoding TYLCSV Rep-210 protein. 

In order to achieve the long lasting expression of TYLCSV Rep- 

20 210 protein in transgenic plants, it has been produced, employing the 
method according to the present invention, a synthetic polynucleotide 
sequence, able to encode for Rep-210 protein, that is not or is an 
ineffective target of the post-trascriptional gene silencing induced by the 
infecting virus. 
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In addition, the following criterions have been followed: 

- the synthetic polynucleotide sequence is not able to encode 
the C4 protein, having in positions 231 and 233 the same mutations 
shown in figure 8, as described for Rep-130 (SEQ ID No 8; SEQ ID No 9). 

5 - the introduced mutations are all silent, namely the protein 

product encoded by the synthetic polynucleotide sequence matches that 
encoded by the viral wild-type sequence; 

- the mutations were introduced according to the frequency of 
codon usage in the tomato genes; particularly whenever possible the more 

10 frequently used codon in tomato was selected; 

- the introduced mutations have been all checked to exclude the 
possible formation of sequences having a particular function, such as for 
example polyadenilation or splicing signals, also cryptic. 

Following the above described criterions, two synthetic 

15 sequences encoding Rep-210 have been designed (figure 16 A and B, 
SEQ ID No 2, SEQ ID No 3, SEQ ID No 4, SEQ ID No 5). 

• - - A non-translated leader sequence at the 5' and a stop codon at 

the 3' have been added to the sequence of the synthetic Rep-210 
silencing minus B gene (SEQ ID No 4). 

20 Particularly, the polynucleotide sequence containing in the 5-3' 

order the non-translated leader sequence, the synthetic sequence 
encoding Rep-210 (figure 16 B; SEQ ID No 4, SEQ ID No 5) and the stop 
codon has been assembled by PCR starting from oligonucleotides 
(Prodromou and Laurence, 1992; Stemmer et al. f 1995), using a 

25 thermostable DNA polymerase with "proof reading" correction activity (Pfu 
DNA Polymerase, Stratagene and/or Pfx DNA Polymerase, Invitrogen). 

The synthetic gene has been subsequently cloned in pJIT60 
plasmid under the transcriptional control of 35S promoter of the 
Cauliflower mosaic virus (CaMV) and the transcription termination 

30 sequences of the CaMV 35S, producing the pJT60Syn. Then the cassette 
containing in the 5'-3' order: 35S promoter, Rep-210 synthetic gene, 35S 
terminator, has been removed from pJT60Syn plasmid by restriction with 
Kpn\-Bgll\ and cloned in the Kpn\-BamH\ sites of the binary plasmid 
pBIN1 9 generating pTOM102Syn. 

35 EXAMPLE 10 : Evaluation of the inhibition of viral replication by 

Rep-210 synthetic gene. 
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The correct expression of Rep-210 protein, encoded by the 
synthetic gene, has been checked through agroinfiltration of N. 
benthamiana leaves, with A. tumefaciens C58C1/pCH32 transformed with 
pTOM102Syn. The strain C58C1/pCH32 transformed with pTOM102 (C4 - 
5 ) has been used as a positive control, while as negative control the strain 
C58C1/pCH32 transformed with the binary plasmid pBIN19 was used. 
Western blot analysis (figure 17) shows the expression of Rep-210 protein 
encoded by the synthetic gene. 

In order to assess the ability of the Rep-210 protein, encoded 

10 by pTOM102Syn, to inhibit TYLCSV replication, a transient co- 
agroinfiltration assay has been carried out. A/, benthamiana leaves have 
been co-agroinfiltrated with A. tumefaciens C58C1/pCH32 strain 
containing the TYLCSV infectious clone (pTOM6) along with the A. 
tumefaciens C58C1/pCH32 strain containing: a) pTOM102Syn plasmid; b) 

15 pTOM102 (C4-) plasmid; c) pBlN19 binary plasmid. The TYLCSV 
replication has been assessed through Southern analysis of the total 
nucleic acids extracted from the co-agroinfiltrated tissues 72 hours after 
the infiltration. This analysis has pointed out that Rep-210 protein 
expressed by the synthetic gene (pTOM102Syn) and by pTOM102(C4 -) 

20 wild-type gene, inhibits TYLCSV replication in a similar manner (figure 18). 

EXAMPLE 11 : Production of transgenic N. benthamiana plants 
expressing the synthetic gene for Rep-210. 

In order to obtain transgenic N. benthamiana plants expressing 
the synthetic gene for the Rep-210, N. benthamiana leaf-discs have been 

25 transformed using the A. tumefaciens LBA 4404 strain containing pTOM 
102Syn plasmid and the kanamycin-resistant plants have been 
regenerated as described (Noris et al. 1996). 

The primary trasformants have been analyzed for the 
expression of Rep-210 protein by Western blot analysis. Four primary 

30 trasformants, 506, 508, 517 and 537 lines accumulating intermediate 
levels of Rep-210 have been selected for further studies. Figure 19 shows 
western analysis of proteins extracted from the 506, 508A and 508B 
plants. 

EXAMPLE 12 : Stability of the resistance in . N. benthamiana 
3 5 transgenic plants for Rep-210 synthetic gene. 

The authors have previously shown (Noris et al. 1996; Brunetti 
et al. 1997) that there is a direct correlation between the amounts of Rep- 
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210 protein produced by the transgenic plants and resistance against 
TYLCSV. Transgenic plants transformed with the pTOM102 construct 
accumulating intermediate levels of Rep-210 protein are susceptible to 
viral infection, like non-transformed plants (Noris et al. 1996 and 
5 unpublished data). The low level of Rep-210 protein in these plants is not 
enough to completely inhibit viral replication, thus allowing the 
establishment of an early virus-induced post-trascriptional silencing 
leading to a drastic reduction in Rep-210 protein accumulation which 
causes lack of resistance. 

10 In order to assess if Rep-210 protein encoded by the synthetic 

gene is not or is an ineffective target of virus-induced post-trascriptional 
gene silencing and therefore to control over time the viral infection, line 
102.22 transgenic plants (R3) and line 506 transgenic plants (RO) 
expressing similar amount of Rep-210 (figure 20, 0 wpi), have been agro- 

15 inoculated with the TYLCSV and analysed by dot blot at week intervals for 
the accumulation of the TYLCSV. As expected (Noris et al., 1996) the 
transgenic R3 line 102.22 plants (figure 21, 5-8) that accumulate 
intermediate levels of Rep-210 protein are susceptible as the non- 
transformed plants (figure 21, 1-4). As shown by dot blot analysis (figure 

20 21 , 9-12) the transgenic plants for the synthetic construct (R0 line 506) are 
resistant to viral infection, accumulating only limited amounts of virus. 
Interestingly and according to virus inability to post-transcriptionally silence 
effectively the synthetic gene, Rep-210 was still accumulating 5 weeks 
after inoculum (figure 20, 5 wpi) and inhibiting over time TYLCSV 

25 replication (figure 21 , 9-12). 

The results described in the examples point out that it is 
possible to obtain a long lasting resistance against geminiviruses by 
expressing in plant a transgene consisting of a pathogen-derived 
polynucleotide sequence, if the latter is suitably selected or modified in 

30 order not to be a target or to be an ineffective target of the post- 
trascriptional gene silencing by the infecting virus. 

EXAMPLE 13 : Construction of a synthetic polynucleotide 
sequence modified in order not to be a target or to be an extremely 
ineffective target of the post-trascriptional degradation induced by the 

35 infecting virus, encoding the TYLCSV capsid protein. 

As above reported, the transgenic expression of the TYLCV 
capsid protein in a interspecific tomato hybrid (Lycopersicon esculentum X 
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L pennellii) confers a partial resistance against viral infection (Kunik et al., 
1994). Also in this case, the resistance mediated by the expression of the 
capsid protein is not long lasting. 

In order to obtain a stable expression of the TYLCSV capsid 
5 protein (CP) by transgenic plants, it has been produced a synthetic 
polynucleotide sequence, able to encode the CP, which results in an 
ineffective target of virus-induced post-trascriptional gene silencing. 

The synthetic polynucleotide sequence has been designed so 
as to satisfy the requisite not to be or to be an extremely ineffective target 
10 of virus-induced post-trascriptional gene silencing employing the method 
according to the present invention. 

In addition, the following criterions have been followed: 

- the introduced mutations are all silent, namely the protein 
product encoded by the synthetic polynucleotide sequence is exactly 

15 matching that encoded by the viral wild-type sequence; 

- the introduced mutations consider the frequency of codon 
usage in the tomato genes; particularly whenever possible the codon more 
frequently used in tomato is selected; 

- the introduced mutations have been all checked to exclude the 
20 possible formation of sequences having a particular function, such as for 

example polyadenilation signals or splicing signals, also cryptic. 

Following the above described criterions a synthetic sequence 
encoding CP has been designed (SEQ ID No 12) (figure 22, TYLCSV CP 
silencing minus, SEQ ID No 6, SEQ I'D No7). 

25 
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CLAIMS 

1. Polynucleotide sequence encoding a geminivirus-derived 
aminoacid sequence, said polynucleotide sequence being characterised in 
that is not or is an ineffective target of the viral post-trascriptional silencing 

5 and has: 

a) an homology at nucleotidic level below or equal at 90% with 
respect to the corresponding gene sequence of the geminiviruses against 
which the resistance is required; 

b) a continuous homology in the transcribed RNA, with respect 
10 to the corresponding gene sequence of the geminiviruses, below or equal 

to 17 nucleotides; 

c) a maximum length of the sequence containing a single 
substitution with respect to the corresponding gene sequence of the 
geminiviruses not higher than 30 nucleotides 

15 said polynucleotide sequence being able to confer to the plants, 

tissues or plant cells therewith transformed, a lasting resistance against 
the geminiviruses. 

2. Sequence according to claim 1 , wherein the homology at the 
nucleotidic level with respect to the corresponding gene sequence of the 

20 geminivirus is below or equal to 80%. 

3. Sequence according to claim 1 , wherein the homology at the 
nucleotidic level with respect to the corresponding gene sequence of the 
geminivirus is below or equal to 70%. 

4. Sequence according to each of the preceding claims, 
25 wherein the continuous homology in the' transcribed RNA with respect to 

the gene sequence of the geminiviruses is below or equal to 8 nucleotides. 

5. Sequence according to each of the preceding claims, 
wherein the continuous homology in the transcribed RNA with respect to 
the gene sequence of the geminiviruses is below or equal to 5 nucleotides. 

30 6. Sequence according to each of the preceding claims, 

wherein the maximum length of the sequence containing a single 
substitution with respect to the corresponding gene sequence of the 
geminiviruses is not more than 20 nucleotides. 

7. Sequence according to each of the preceding claims, 

35 wherein the maximum length of the sequence containing a single 
substitution with respect to the corresponding gene sequence of the 
geminiviruses is not more than 9 nucleotides. 
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8. Sequence according to each of the preceding claims, 
wherein the polynucleotide sequence has been mutated or it is a wild-type 
sequence selected from geminivirus so as to differ at the nucleotidic level 
with respect to the corresponding genomic sequence of the geminivirus 

5 against which a resistance is required according to each of the preceding 
claims: 

9. Sequence according to each of the preceding claims, 
wherein the geminiviruses are selected from the group consisting of 
species of Mastrevirus, Curtovirus, Begomovirus and Topocuvirus and 

10 isolates thereof. 

10. Sequence according to claim 9, wherein Begomoviruses 
species are selected from the group consisting of TYLCCNV, TYLCGV, 
TYLCMalV, TYLCSV, TYLCTHV, TYLCV, ACMV, BGMV, CaLCuV, 
ToCMoV, TGMV, ToGMoV, ToMHV, ToMoTV, ToMoV, ToRMV, ToSLCV, 

15 ToSRV, Cotton leaf curl (CLCrV, CLCuAV, CICuGV, CLCuKV, CLCuMV, 
CLCuRV), East African cassava mosaic (EACMCV, EACMMV, EACMV, 
EACMZV), Potato yellow mosaic (PYMPV, PYMTV, PYMV), Squash leaf 
curl (SLCCNV, SLCV, SLCYV), Sweet potato leaf curl (SPLCGV, SPLCV), 
Tobacco leaf curl (TbLCJV, TbLCKoV, TbLCYNV, TbLCZV), Tomato leaf 

20 curl (ToLCBV, ToLCBDV, ToLCGV, ToLCKV, ToLCLV, ToLCMV, 
ToLCNDV, ToLCSLV, ToLCTWV, ToLCW, ToLCV) and isolates thereof. 

11. Sequence according to claim 9, wherein the species 
belonging to the genus Mastrevirus, Curtovirus, Topocuvirus are selected 
from the group consisting of WDV, MSV, SSV, BYDV, TYDV, BCTV and 

25 isolates thereof. 

12. Sequence according to claim 1, wherein the gene sequence 
is selected from the group consisting of C1/AL1/AC1, C2/AL2/AC2, 
C3/AL3/AC3, C4/AL4/AC4, V1/AR1/AV1, V2/AR2/AV2, BC1/BL1 and 
BV1/BR1, belonging to the geminiviruses. 

30 13. Sequence according to claim 12, wherein C1/AL1/AC1 gene 

sequence is from the geminiviruses, as defined according to claims 10 and 
11. 

14. Sequence according to claim 1, wherein the geminivirus 
aminoacidic sequence is a pathogen-derived protein able to confer 

35 resistance against the geminiviruses to the plants expressing the protein. 

15. Sequence according to claim 14, wherein the protein is 
selected from the group consisting of capsid protein, replication associated 
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viral protein (Rep), proteins encoded by the genes C2/AL2/AC2, 
C3/AL3/AC3, C4/AL4/AC4, V2/AR2/AV2, BC1/BL1 and BV1/BR1. 

16. Sequence according to claim 1, wherein the plants, tissues 
or cells thereof belong to the group consisting of tomato, pepper, tobacco, 

5 squash, manioc, sweet potato, cotton, melon, potato, soybean, wine, com, 
wheat, sugar cane, bean, beet. 

17. Polynucleotide sequence encoding a geminivirus-derived 
aminoacid sequence, said polynucleotide sequence being characterised in 
that is not or is an ineffective target of the viral post-trascriptional silencing 

10 and has homology even equal to 100% with respect to the sequence of 
the geminivirus against which a resistance is required and is suitably 
shortened to be under represented in the siRNAs population with respect 
to the original sequence. 

18. Construct comprising an heterologous polynucleotide 
15 sequence containing in the 5'-3 ? direction: 

a) a polynucleotide sequence acting as promoter in said plant 
or tissue or transformed cells; 

b) a non translated polynucleotide sequence positioned 5' of the 
encoding region; 

20 c). a polynucleotide sequence as defined according to claims 

from 1 to 17, a fragment or a variant thereof; 

d) a sequence acting as transcription terminator, positioned 3' 
with respect to said polynucleotide sequence. 

19. Expression vector comprising the construct as defined 
25 according to claim 18. 

20. Transgenic plant, tissue or plant cells thereof, comprising in 
their genome a polynucleotide sequence as defined according to claims 
from 1 to 17. 

21. Progeny of the plants and plant tissues according to the 

30 claim 20. 

22. Seed comprising in its genome a polynucleotide sequence 
as defined according to claims from 1 to 17. 

23. Method for the preparation of transgenic plants, plant tissue 
or cells thereof long lasting resistant against geminiviruses including the 

35 following steps: 

a) identification or selection of a viral gene sequence encoding 
an aminoacid sequence able to confer resistance against geminiviruses; 
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b) mutagenesis or choice of the viral gene sequence so as to 
make it an ineffective target of the post-trascriptional silencing induced by 
infecting geminivirus; 

c) insertion of the geminivirus gene sequence mutated or 
5 chosen in the step b) in the plant, plant tissue or cell thereof using a 

construct as defined according to claim 18. 

24. Method according to claim 23, wherein by mutagenesis the 
homology at the nucleotidic level with respect to the gene sequence of the 
geminivirus against which resistance is required is maintained below or 

10 equal to 90 %, distributed in such way that the continuous homology in the 
transcribed RNA with respect to the corresponding gene sequence of the 
geminiviruses is below or equal to 17 nucleotides and the maximum length 
of the sequence containing a single substitution with respect to the 
corresponding gene sequence of the geminiviruses is not higher than 30 

15 nucleotides. 

25. Method according to anyone of the claims 23 and 24, 
wherein by mutagenesis an homology at the nucleotidic level with respect 
to the gene sequence of geminivirus is maintained below or equal to 80%. 

26. Method according to anyone of the claims 23 and 24, 
20 wherein by mutagenesis an homology at a nucleotidic level with respect to 

the gene sequence of geminivirus is maintained below or equal to 70%. 

27. Method according to anyone of the claims from 23 to 26, 
wherein by mutagenesis a continuous homology at the nucleotidic level 
with respect to the gene sequence of geminivirus is maintained below or 

25 equal to 8 nucleotides. 

28. Method according to anyone of the claims from 23 to 27, 
wherein by mutagenesis a continuous homology at the nucleotidic level 
with respect to the gene sequence of geminivirus is maintained below or 
equal to 5 nucleotides . 

30 29. Method according to anyone of the claims from 23 to 28, 

wherein the maximum length of the sequence containing a single 
substitution with respect to the corresponding gene sequence of 
geminiviruses is no more than 20 nucleotides. 

30. Method according to anyone of the claims from 23 to 29, 

35 wherein the maximum length of the sequence containing a single 
substitution with respect to the corresponding gene sequence of 
geminiviruses is no more than 9 nucleotides. 
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31. Method according to anyone of the claims from 23 to 30, 
wherein the mutagenesis consists of silent point mutations or deletions 
and/or insertions and/or substitutions. 

32. Method according to anyone of the claims from 23 to 31, 
5 wherein the mutagenesis in step b) consists of deletions of the 5' or 3' 

regions of the viral gene sequence of step a) until the identification of the 
minimum region of said gene sequence that is under represented with 
respect to the sequence encoding the wild-type protein, in the population 
of the interfering siRNAs and that said truncated protein maintains the 
10 ability to confer resistance against geminiviruses. 

33. Method according to claim 32, wherein the viral gene 
sequence of step a) is the C1/AL1/AC1 gene. 

34. Method according to claim 32, wherein the C1/AL1/AC1 
gene is a TYLCSV gene. 

15 35. Method according to claim 32, wherein the aminoacid 

sequence is a truncated protein with respect to the viral wild-type protein. 

36. Method according to anyone of the claims from 32 to 35 in 

which the viral gene sequence made not target or ineffective target of the 

post-trascriptional silencing is the SEQ ID No 8. 
20 37. Method according to anyone of the claims from 32 to 36, 

wherein the truncated protein is Rep-130 (SEQ ID No 9). 

38. Method according to anyone of the claims from 23 to 31, 
wherein the mutagenesis in step b) consists of silent point mutations of the 
viral gene sequence of step a) to maintain the ability of the aminoacid 

25 sequence, encoded by the same, to confer resistance against 
geminiviruses and not to be or to be an ineffective target of the post- 
trascriptional silencing. 

39. Method according to claim 38, wherein the viral gene 
sequence of step a) is the V1/AR1/AV1 (CP) gene. 

30 40. Method according to claim 39, wherein the V1/AR1/AV1 

(CP) gene is a TYLCSV gene. 

41 . Method according to anyone of the claims from 38 to 40, 

wherein the virial gene sequence made not target or ineffective target of 

the post-trascriptional silencing is the SEQ ID No 8. 
35 42. Method according to claim 38, wherein the viral gene 

sequence of step a) is C1/AL1/AC1 of TYLCSV. 
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43. Method according to claim 42, wherein the viral gene 
sequence made not target or ineffective target of the post-trascriptional 
silencing is the SEQ ID No 2 or the SEQ ID No 4. 



ABSTRACT OF THE DISCLOSURE 



The invention relates to a method for the production of 
transgenic plants durably resistant against Geminiviruses , 
wherein the transgene consists of a suitably modified 
polynucleotide sequence, derived by the pathogen. 



WO 2004/101798 



PCT/IT2004/000287 



1/16 




Fig.1 



WO 2004/101798 



PCT7IT2004/000287 



2/16 



0 wpi 



4 wpi 



8 wpi 







Nl Nl 


+ + - + 




Nl Nl 


Nl 


Nl 












t i 






1 2 3 


4 5 


6 7 


12 3 4 


5 


6 7 


6 


7 



Rep-210 



Fig. 2 



0 wpi 22 wpi 



+ - + 






12 3 12 3 

Fig. 3 



Viral infection 
Rep-210 

Rep-210 mRN A 

nptll mRNA 
25S rRNA 



WO 2004/101798 PCT/IT2004/000287 



3/16 



plants 47X10D 



+ + ++ + + + + + + 
wt - - - - + + + + + + 



m m± -mm <m 



C1 Sense 
CI Antisense 

Rep-210 




Nl 



+ + + + + + + 



EtBr 

^ 21-22 ntsiRNA 

Presence of virus 
after 4 weeks from 
inoculum 



Fig. 4 



WO 2004/101798 



PCT/IT2004/000287 




WO 2004/101798 



PCT/IT2004/000287 



5/16 




Fig. 6 



WO 2004/101798 



6/16 



PCT/IT2004/000287 




Fig. 7 



WO 2004/101798 



PCT/IT2004/000287 



7/16 



SEQ ID No 8 

GGATCCCCctggatactttgagtgtcccccgattcagaac 4 0 
gacagcaaaaatgccaagatcaggtcgttttagtatcaag 80 
gctaaaaattatttccttacatatcccaaatgtgatttaa 120 
caaaagaaaatgcactttcccaaataacaaacctacaaac 160 
acccacaaacaaattattcatcaaaatttgcagagaacta 200 
catgaaaatggggaacctcatctccatatt ttgratccaat 24 0 
tcgaaggaaaatacaattgtaccaatcaacgattcttcga 2 80 
cctggtatccccaaccaggtcagcacatttccatccgaac 320 
attcagggagctaaatcgagctccgacgtcaagtcctata 360 
tcgacaaggacggagatgttcttgaatggggtactttcca 400 
gatcgacggacgatctgctaggggaggacaacagacagcc 440 
tGAATTC 44 7 



Fig. 8 



WO 2004/101798 



PCT/IT2004/000287 





25 



50 



75 



100 125 



150 



175 



% of replication 



Fig. 10 



WO 2004/101798 



PCT/IT2004/000287 




Fig. 1 1 



WO 2004/101798 



PCT/IT2004/000287 



10/16 




Fig. 1 2 




WO 2004/101798 PCT/IT2004/000287 



11/16 



303 WT 402 403 406411 413 416 417 



Rep-130 ► 




Fig. 14 



WT Rep-1 30 




Fig. 15 



WO 2004/101798 PCT/IT2004/000287 



12/16 



a O 

IS 
S | 



Z LU 

o<2. 



ft i!! t: i!§ 



is 

2 1 

a> ™ 

Si 



9^ 



-8-, 



-13 



as 

|| 

w i 

< < 

< < 
o o 

"(3 

o o 

OO 
t— I— 
<< 

< < 



<< 
O O 
H- h- 
<< 
O O 

=S 

- v- 
oo 
<< 

< < 

oo 

< < 

"1 



-8- 



- oo 
< < 
<< 
oo 

-i 

<< 
<< 
o o 
oo 
<< 

-@ 



< < 

i 

<< 

HI- 

1 

< < 

< < 

m 

oo 
oo 
<< 
<< 
oo 
oo 



<< 

< < 

— H- 

< <c 
oo 



X < 
JO 



-s- 



- I- 

<i 

<< 
<< 
-oo 
< < 
<< 

<m 

oo 
<< 
oo 
oo 
oo 



og 

•i 

J— h- 

<< 
hh 
<< 
oo 



s 

V) I 



<< 
oo 



<< 
-oo 
< < 
<< 

°B 

oo 
oo 



<< 
oo 

- h- 

1-t- 
<< 
oo 

<! 

oo 
oo 
<< 
o o 

»@ 

oo 
< < 
oo 
oo 
<< 

«@ 

oo 
oo 
-o o 
oo 

hh 

<§ 
l-l- 

oo 

«E1 

oo 
oo 

-<< 

oo 

"El 



cr a> 

-0 



OO 
1-1- 

<< 
oo 

«! 

oo 
oo 
oo 
- << 
oo 

< < 
<< 
oo 
<< 
oo 



i<< 

-0 

<< 



oo 
<< 
<< 

I— h" 

o o 
o o 

< < 
oo 

"I 

o o 
I- 1- 

°0 

oo 

< < 
oo 
oo 



oo 

<< 
oo 

-@ 

oo 
oo 



S3 



<< 
< < 
1- J— 
o o 
- oo 

<H 

o o 
o o 
o o 



9*2- 

H 

II 

I 



K 

<i 

<< 

-B 

<: < 
- < < 
t- 1- 
< < 
o o 

-B 



<< 

oo 



oo 

H I— 



«0 

<< 
I- h- 



- << 
OO 

'i 

o o 
< < 
l->- 
oo 
oo 

«i 

o o 
-oo 
<< 



<< 
oo 
<< 
<< 
<< 



•i 



< < 
<< 

oo 

-H 

<< 
oo 



5? 



9& 

Sl 

<D » 



oo 
oo 



s- 



oo 
oo 

fl 

oo 
oo 
oo 
g-oo 



-s- 



0 



■a 



oo 
oo 
oo 

b- h- 

oo 

og 
<< 

oo 

oo 



H I— 

oo 

°i 

<< 
<< 

-@ 

<< 

oo 



«t < 
<< 



2 
952. 

gs 

If 



OO 
OO 
OO 



oo 
oo 
oo 

«H 

<< 
oo 

- << 
oo 



<< 
oo 



<< 
- oo 



oo 
oo 



oo 

<i 

<< 
oo 

<< 

OO 

°e 

-»- 

- h- 

-B 



^■9 



si 

^3 



-§Hoo 

< < 

<< 
<< 



WO 2004/101798 



PCT7IT2004/000287 



13/16 



s 



9 



e 



8 



S 



3 
9 



pa m 
2 to 

■I 



.g « 

o ra 

i-i ^ 

s.S 



fit W ftt 



3 in 
o o 

H rH 
CM CM 

L CU 
22 



< < 
CJ U 

u cj 
cj cj 

E-* Eh 

U U 

< *i 

*H 
EH H 

U H 
CJ U 

H H 

H H 

< < 

Eh £-» 
Eh Eh 

< < 

< < 

CJ CJ 

cd cd 
«< #u 
u cj 

cd cd 

< < 

Eh EH 
Eh H 
E- Eh 

cd cd 

3 

cd cd 
cd o 

<m 

cj cj 

H Eh 

O CD 

< < 

cj u 
cj cj 
cd cd 

Eh Eh 



-H -H 
5 to 

o o 

rH 

CM CM 



N-CJ 



u o 
o cj 
cj cj 

•S- cj u 

< < 
cj cj 

Eh Eh 

W H 

u cj 
*: 

< 

cj cj 

< <e 

rt! < 
Eh H 

*B 

*: 

cj u 
u u 
u u 

H EH 

EH EH 

< < 

O CJ 
CD CD 
Eh Eh 
°- < < 

< < 

CD CD 



< < 

«; < 

H Eh 
Eh Eh 

o o 

H Eh 
CJ O 
Eh Eh 



5 m 
o o 

rH rH 
CN CM 
I I 



< 

^0 

Eh E-i 
U U 
H eh 

< 

u u 

o o 

< rtj 
rtj 

o o 

u O 

a o 
o a 

Eh Eh 
«< < 

< < 

< 
u o 

O O 

< 

U CP 

<n 

^ «< 

Eh Eh 
Eh H 
Eh Eh 
rtj < 

< < 

EH EH 

ft < 

U U 
Eh Eh 
Eh Eh 

<B 

Eh Eh 
Eh Eh 

CM <N 



a a) 
o c 
3 e 

I! 

si 
« 

o o 

iH iH 

CM CM 
I 1 

MS 

U CJ 

a u 

Eh H 

EH EH 

O O 

«H 

H Eh 
U CJ 
U CJ 

< < 

a e> 
u @ 

Eh Eh 

Eh Eh 

O O 

Eh Eh 

Eh H 

«[S 
O CD 

CJ U 

rtj 

rtj < 
CJ CJ 

< < 

CJ u 
U CJ 

CD CD 
Eh Eh 
*h0 

H 

< 

Eh H 
<^ 

< 

CD CD 
CD CD 

< < 
« < 
CD CD 

u a 

Eh Eh 
Eh Eh 

< < 
CJ CJ 

°E1 

Eh Eh 

< rtl 
Cjg] 
Eh Eh 



63 



-H -H 
S to 

o o 

rH <H 
CM CM 

If 

°^ 
Eh H 

CD CD 

O CJ 

< < 
CD CD 

CJ u 
&h Eh 

CD CD 

< < 
CD CD 
CJ CJ 
Eh Eh 

< 

Eh Eh 
CJ CJ 
CD CD 

«m 

CD CD 
CD O 
CD CD 

< < 
CJ CJ 

*m 

Eh Eh 
CJ CJ 

< < 

U CJ 
CJ CJ 
Eh £h 

< < 
CJ CJ 

El 

EH Eh 
Eh H 
Eh Eh 
ri! < 
CJ CJ 

<0 
U CJ 

CD CD 

< < 
CJ CJ 
EH Eh 

«@3 

CD CD. 

< < 
CJ CJ 
CJ CJ 

«: 

<M 

CJ CJ 
CJ CJ 



52 e 

si 

o o 



CD CD 
CD O 
U U 

< < 
CD CD 

U S 
Eh Eh 

< < 
CD CD 

< < 
CJ CJ 
CJg] 
Eh Eh 
H Eh 
Eh Eh 
CJ CJ 

< < 

CD CD 
CD CD 
CD CD 
CD CD 
Eh Eh 

EI 

< •< 

CD CD 
H [0] 
Eh Eh 
CJ CJ 
Eh [CD] 
EH Eh 

CD CD 
Eh EH 

< < 
CD CD 

<m 

CD CD 
CD CD 
CJ CJ 

< < 
0 o 
CDg] 

< < 

< < 
CJ CJ 

< < 

CD CD 

U 0 
H H 

rt! < 

H[Sj 

*< rtj 

EH Eh 

U 0 
O O 

Eh Eh 

CD CD 

< < 



w e 

M 



5 to 

S3 

CM CM 

a a 

CD QJ 

<s 

CJ CJ 
CD CD 
O CJ 

CJ CJ 
CD CD 
CD CD 

< < 

CJ u 
CD CD 
U CJ 

< < 
EH £h 

CJ U 
CD CD 

U P 

< rt! 
CD CD 
CJ CJ 

< < 

< < 

CJ CJ 
CD CD 
* ri! 

cj a 

< < 

CJ CJ 

o a 

<H 

CD C5 
CD CD 

< < 

CD CD 

®H 

O CD 
rt! 

Eh Eh 
CJ CJ 
O CD 

H[g 

CJ CJ 
Eh Eh 

< < 
CD CD 
CJ CJ 



S § 

II 

IS 
ss 

o o 

rH rH 
CM CM 
1 I 

a p4 

Eh Eh 
CJ CJ 
Eh Eh 
Eh Eh 
CD CD 

< < 

E-» Eh 

Eh Eh 

< < 
CD CD 

«m 

CD CD 

< rfj 
Eh Eh 

a a 

o CJ 

«^ 

CJ CJ 
CD CD 

< rt 

Eh Eh 
Eh Eh 

CD CD 

\< < 

EH H 

< < 

Eh Eh 
CD CD 

CD CD 
Eh Eh 
Eh Eh 
CJ CJ 

*m 

CJ CJ 
CD CD 
CD CD 

< < 
CJ CJ 

og] 

CJ CJ 
Eh Eh 
CD CD 

fp3 



a 

CD CD 
CD CD 



ss 



CM CM 

Dj Oj 

a> a) 
c*J erf 

*® 

< < 

Eh Eh 

Eh Eh 

U U 

CJ CJ 

«ll 
CJ CJ 

cd a 

CD CD 
CJ CJ 
CJ CJ 

*m 

CJ CJ 
CJ CJ 
CD CD 
Eh Eh 
CD CD 

CJ u 
CJ u 
H Eh 
H EH 

^E>] 

Eh Eh 
CD CD 

e>® 
cd a 

Eh Eh 

Eh H 

H Eh 

< < 

< < 

CD O 

Eh Eh 

Eh H 

CJ CJ 

H H 
Eh Eh 
H Eh 
rtl 
CJ CJ 



si 

H 

§ 



«m 

H Eh 
CD CD 
CD CD 
CD CD 
H Eh 

< < 
O CJ 

C9 CD 
H EH 
Eh Eh 
CJ CJ 

< < 
CD CD 
Eh H 

< < 
CD CD 

*H 

a cj 

CJ CJ 
Eh Eh 
Eh EH 
CD CD 

< < 

CJ CJ 

H Eh 

< < 
CD CD 

«® 
Eh Eh 
Eh Eh 

*m 

CJ CJ 
EH Eh 
H EH 
CJ CJ 
Eh Eh 
Eh Eh 

< <: 

Eh H 
Eh Eh 

*B 

H EH 

Eh Eh 
Eh Eh 
O CJ 
CJ CJ 

EHgj 

U CJ 
H Eh 
Eh Eh 
H Eh 
CD CD 



CO 

vo 



B 

CD CD 
CJ CJ 

°s 

O CJ 
CD CD 

*m 

u u 

CD CD 
CJ CJ 
CJ CJ 
CD CD 

CD CD 
CD CD 
Eh Eh 

EH Eh 
H- CD CD 
CJ u 

< < 

< < 

CD CD 

CJ CJ 
EH Eh 



WO 2004/101798 



PCT/IT2004/000287 



14/16 



4 
o 



O CO 

t- CM CN 

CN O O 

% 1 i i 

eg ? O O 

s £ t t 

CD 

S C 1 2 3 4 5 6 



Rep-210 — ► £ 




Fig. 17 



cl >^ 

>S >S > > 

co ^ (05 co °> CO ^ 

Si Si Si Si 



1 2 3C456789 ^ § §2 



dsDNA — ► :.■ 

ssDNA— ^ ' • - ' * 



Fig. 18 



WO 2004/101798 



PCT/IT2004/000287 



15/16 



o 



Rep-210 — ► 




Fig. 1 9 




wt 102.22 506 

1 2 3 4 5 6 7 8 9 10 11 12 

4 wpi v^i'ft ^ w)iNpw ; * * ' 

Fig. 21 



WO 2004/J01798 



PCT/IT2004/000287 



16/16 



tr-> £-> 

o cd 

CJ CJ 
CJ CJ 

CJ CJ 

•<« < 
ug 

Eh E- 
E-i Ei 

CJ U 

■< <c 
*< 

oQ 

E- Eh 
CJ O 

CD CD 

*c 
o cd 

E- Eh 
CD CD 

E- JgJ 

CD CD 

CD cd 
«c «< 

CJ CJ 
E- 

CJ CJ 

cj CJ 
CJ CJ 

°s 

CJ CJ 

«: <c 
<c «: 
cj CJ 

E-i £-• 
Eh E- 
-< 

E- E-« 
CJ CJ 

<cj3 

EH E-h 

*=c < 

E- 6- 

o cd 

°H 

CD CD 
CD CD 

cjQ 

CJ CJ 

< -< 

CD CD 
CD CD 



CJ CJ 

o o 

CD CD 



_rH- «S 



■ cppg 



CD CD 
E- Eh 
E-i Eh 

-c5~Jcj cj 
CD CD 
Eh 

•a: *c 
«c 

°E3 

< <c 

CD CD 
Eh Eh 
CJ CJ 

<< 

Eh EH 

CJ CJ 
Eh Eh 
*Z «< 

< 

-< 

CD CD 
Eh £h 

*s *x 

Eh eh 

cdE» 

<x 
•< <z 

CD CD 
CJ CJ 
CD CD 

o gg 

CJ CJ 
CD CD 
« *C 
CD CD 

■< m 

< «; 

CJ CJ 
CJ CJ 



EH Eh 



CJ CJ 



CJ CJ 
CD CD 



CD CD 
Eh 



Eh Eh 
Eh eh 
-tSt- Eh eh 

«c 

Eh e-. 

H 

Eh Eh 
-< 
Eh Eh 
CD CD 
CJ CJ 



CD CD 
CJ CJ 

*s 

EhQ 

CJ CJ 
CD CD 
CJ CJ 

*e «* 

Eh Eh 

cd ra 

E-h eh 
CD CD 



10/557 



WO 2004/101798 PCT/IT2004/000287 

SEQUENCE LISTING {AP20 KSS'J 777,770 1 7 NOV 

<110> ENEA-Ente per le Nuove Tecnologie e VAmbiente 
consiglio Nazionale delle Ricerche 

<120> Method for the preparation of transgenic plants characterised by 
Geminivirus lasting resistance 



<130> 


PCT25622 








<140> 
<141> 


RM2003A000242 
2003-05-19 








<150> 
<151> 


RM2003A000242 
2003-05-19 








<160> 


12 








<170> 


Patentln version 3.2 






<210> 
<211> 
<212> 
<213> 


1 

630 
DNA 

Geminivirus TYLCSV 






<400> 1 

atgccaagat caggtcgttt 


tagtatcaag 


gctaaaaatt atttccttac atatcccaaa 


60 


tgtgatttaa caaaagaaaa 


tgcactttcc 


f~ a a a a a r* a a arrtarsaar arrr:3f , 3 - 3 r~ 
<_dcld.T-d.clL.cAd. dt.v, LdCdddt. dLCLdLdddC 




aaattattca tcaaaatttg 


cagagaacta 


catgaaaatg gggaacctca tctccatatt 


180 


ctcatccaat tcgaaggaaa 


atacaattgt 


accaatcaac gattcttcga cctggtatcc 


240 


ccaaccaggt cagcacattt 


ccatccgaac 


attcagggag ctaaatcgag ctccgacgtc 


300 


aagtcctata tcgacaagga 


cggagatgtt 


cttgaatggg gtactttcca gatcgacgga 


360 


cgatctgcta ggggaggaca 


acagacagcc 


aacgacgctt acgcaaaggc aattaacgca 


420 


ggaagtaagt cgcaggctct 


tgatgtaatt 


aaagaattag cgcctagaga ttacgttcta 


480 


cattttcata atataaatag 


taatttagat 


aaggttttcc aggtgcctcc ggcaccttat 


540 


gtttctcctt ttttatcttc 


ttctttcgat 


caagttcctg atgaacttga acactgggtt 


600 


tccgagaacg tcatggatgc 


cgctgcgcgg 




630 


<210> 
<211> 
<212> 
<213> 


2 

630 
DNA 

Artificial 








<220> 
<223> 


TYLCSV Rep-210 


modified sequence 




<220> 
<221> 
<222> 


CDS 

CD . . (630) 








<400> 


2 









atg cct aga tec gga agg ttt age ate aaa get aag aat tac ttc ttg 48 
Met Pro Arg ser Gly Arg Phe Ser lie Lys Ala Lys Asn Tyr phe Leu 
1 5 10 15 

aca tac ccc aag tgt gac tta act aag gag aat gca ttg tec cag ata 96 
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Thr Tyr Pro Lys Cys Asp Leu Thr Lys Glu Asn Ala Leu Ser Gin lie 
20 25 30 



20 

act aac ttg caa act ccc act aac aag ttg ttc att aag att tgt agg 

Thr Asn Leu Gin Thr Pro Thr Asn Lys Leu Phe lie Lys lie cys Arg 

35 40 45 

gaa ctt cat gag aat gga gaa cca cat ctt cat ate ttg ata cag ttc 

Glu Leu His Glu Asn Gly Glu Pro His Leu His lie Leu lie Gin Phe 
50 55 60 

gaa ggc aag tat aac tgc acc aac, caa cgt ttc ttt gac ctt gtg tec 

Glu Gly Lys Tyr Asn cys Thr Asn Gin Arg Phe Phe Asp Leu Val Ser 

65 70 75 80 



acc aga tea gee cat ttt cat cca aac ate cag ggt get aag teg 
Thr Arg ser Ala His Phe His Pro Asn lie Gin Gly Ala Lys ser 

90 95 



cca gat gag ctt gag cat tgg gtg tec gaa aac gtt atg gac gee gca 
Pro Asp Glu Leu Glu His Trp Val Ser Glu Asn val Met Asp Ala Ala 
K 195 200 205 

gcg cgt 
Ala Arg 
210 

<210> 3 

<211> 210 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 3 

Met Pro Arg ser Gly Arg Phe Ser He Lys Ala Lys Asn Tyr Phe Leu 
1 5 10 15 

Thr Tyr Pro Lys Cys Asp Leu Thr Lys Glu Asn Ala Leu Ser Gin lie 
20 25 30 
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144 



192 



240 



288 



cct 

Pro „ 

85 90 

agt tea gac gtg aag tea tac att gac aaa gac ggc gat gtg etc gag 
ser ser Asp val Lys Ser Tyr lie Asp Lys Asp Gly Asp Val Leu Glu 
100 105 HO 

tag gga act ttt cag ata gac ggt cga teg get aga gga ggt cag caa 
Trp Gly Thr Phe Gin He Asp Gly Arg ser Ala Arg Gly Gly Gin Gin 
115 120 125 

aca get aac gat gca tac get aag get ate aac get gga tec aag tea 
Thr Ala Asn Asp Ala Tyr Ala Lys Ala He Asn Ala Gly Ser Lys Ser 
130 135 140 

cag gca ctt gac gta ate aaa gag tta get cct agg gat tat gtt ctt 
Gin Ala Leu Asp val He Lys Glu Leu Ala Pro Arg Asp Tyr val Leu 
145 150 155 160 

cat ttc cat aac ate aac age aat ttg gac aaa gtg ttc caa gtg cca 
His Phe His Asn He Asn Ser Asn Leu Asp Lys val Phe Gin Val Pro 
165 170 175 

ccg get cct tac gtt tea cct ttc tta agt tct tea ttt gat cag gtt 576 
pro Ala Pro Tyr Val Ser Pro Phe Leu ser Ser Ser Phe Asp Gin val 
180 185 190 



336 



384 



432 



480 



528 



624 



630 
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Thr Asn Leu Gin Thr Pro Thr Asn Lys Leu Phe lie Lys lie cys Arg 
35 40 45 

Glu Leu His Glu Asn Gly Glu Pro His Leu His lie Leu lie Gin Phe 
50 55 60 

Glu Gly Lys Tyr Asn Cys Thr Asn Gin Arg Phe Phe Asp Leu Val Ser 
65 70 75 80 

pro Thr Arg ser Ala His Phe His Pro Asn lie Gin Gly Ala Lys Ser 
85 90 95 

ser ser Asp Val Lys Ser Tyr lie Asp Lys Asp Gly Asp val Leu Glu 
100 105 110 

Trp Gly Thr Phe Gin lie Asp Gly Arg Ser Ala Arg Gly Gly Gin Gin 
115 120 125 

Thr Ala Asn Asp Ala Tyr Ala Lys Ala lie Asn Ala Gly Ser Lys Ser 
130 135 140 

Gin Ala Leu Asp val lie Lys Glu Leu Ala Pro Arg Asp Tyr Val Leu 
145 150 155 160 

His Phe His Asn lie Asn Ser Asn Leu Asp Lys Val Phe Gin Val Pro 
165 170 175 

Pro Ala Pro Tyr Val ser Pro Phe Leu ser ser Ser Phe Asp Gin Val 
180 185 190 

Pro Asp Glu Leu Glu His Trp Val ser Glu Asn Val Met Asp Ala Ala 
195 200 205 

Ala Arg 
210 

<210> 4 

<211> 630 

<212> DNA 

<213> Artificial 

<220> 

<223> TYLCSV Rep-210 modified sequence 
<220> 

<221> CDS 

<222> (1) . . (630) 



<400> 4 

atg cct aga tec gqa agg ttt age ate aaa get aag aat tac ttc ttg 

Met Pro Arg ser Gly Arg Phe Ser lie Lys Ala Lys Asn Tyr Phe Leu 
15 10 15 
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aca tac ccc aag tgt gac tta act aag gag aat gca ttg tec cag ata 96 
Thr Tyr Pro Lys cys Asp Leu Thr Lys Glu Asn Ala Leu Ser Gin lie 
20 25 30 

act aac ttg caa act ccc act aac aag ttg ttc att aag att tgt agg 
Thr Asn Leu Gin Thr Pro Thr Asn Lys Leu Phe lie Lys He Cys Arg 
35 40 45 

gaa ctt cac gag aat gga gaa cca cat ctt cat ate ttg ata cag ttc 
Glu Leu His Glu Asn Gly Glu Pro His Leu His He Leu lie Gin Phe 
50 55 60 

gaa ggc aag tat aac tgc acc aac caa cgt ttc ttt gac ctt gtq tec 
Glu Gly Lys Tyr Asn Cys Thr Asn Gin Arg Phe Phe Asp Leu Val ser 
65 70 75 80 

cct acc aga tea gec cat ttt cat cca aac ate cag ggt get aag teg 
pro Thr Arg Ser Ala His Phe His Pro Asn He Gin Gly Ala Lys ser 
85 90 95 

agt tea gac gtq aag tea tac att gac aaa gac gqg gat gtq etc gag 
Ser ser Asp val Lys ser Tyr lie Asp Lys Asp Gly Asp Val Leu Glu 
100 105 110 

tag gga act ttt cag ata gac ggt cga teg get aga gga ggt cag caa 
Trp Gly Thr Phe Gin He Asp Gly Arg Ser Ala Arg Gly Gly Gin Gin 
115 120 125 

aca gca aac gat gca tac get aag get ate aac get gga tec aag tea 
Thr Ala Asn Asp Ala Tyr Ala Lys Ala lie Asn Ala Gly Ser Lys ser 
130 135 140 

cag gca ctt gac gta ate aaa gag tta get cct agg gat tat gtt ctt 
Gin Ala Leu Asp Val He Lys Glu Leu Ala Pro Arg Asp Tyr Val Leu 
145 150 155 160 

cat ttc cat aac ate aac age aat ttg gac aaa gtq ttc caa gtq cca 
His Phe His Asn lie Asn Ser Asn Leu Asp Lys val Phe Gin Val Pro 
165 170 175 

ccg get cct tac gtt tea cct ttc tta agt tct tea ttt gat cag gtt 576 
Pro Ala Pro Tyr val ser Pro Phe Leu Ser ser Ser Phe Asp Gin val 
180 185 190 



144 



192 



240 



288 



336 



384 



432 



480 



528 



cca gat gag ctt gag cat tgg gtg tct gaa aac gtt atg gac gee gca 624 
pro Asp Glu Leu Glu His Trp val ser Glu Asn Val Met Asp Ala. Ala 
195 200 205 

gee cgt 630 
Ala Arg 
210 

<210> 5 

<211> 210 

<212> PRT 

<213> Artificial 

<220> 

<223> synthetic construct 
<400> 5 

Met Pro Arg Ser Gly Arg Phe ser lie Lys Ala Lys Asn Tyr Phe Leu 
15 10 15 

Thr Tyr Pro Lys cys Asp Leu Thr Lys Glu Asn Ala Leu Ser Gin lie 
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20 25 30 

Thr Asn Leu Gin Thr Pro Thr Asn Lys Leu Phe lie Lys He Cys Arg 
35 40 45 

Glu Leu His Glu Asn Gly Glu Pro His Leu His lie Leu lie Gin Phe 
50 55 60 

Glu Gly Lys Tyr Asn Cys Thr Asn Gin Arg Phe Phe Asp Leu val Ser 
65 70 75 80 

pro Thr Arg Ser Ala His Phe His Pro Asn lie Gin Gly Ala Lys Ser 
85 90 95 

ser ser Asp Val Lys Ser Tyr lie Asp Lys Asp Gly Asp Val Leu Glu 
100 105 110 

Trp Gly Thr Phe Gin lie Asp Gly Arg Ser Ala Arg Gly Gly Gin Gin 
115 120 125 

Thr Ala Asn Asp Ala Tyr Ala Lys Ala lie Asn Ala Gly ser Lys ser 
130 135 140 

Gin Ala Leu Asp Val lie Lys Glu Leu Ala Pro Arg Asp Tyr Val Leu 
145 150 155 160 

His Phe His Asn lie Asn Ser Asn Leu Asp Lys val Phe Gin Val Pro 
165 170 175 

Pro Ala Pro Tyr val ser Pro Phe Leu ser ser Ser Phe Asp Gin Val 
180 185 190 

Pro Asp Glu Leu Glu His Trp Val Ser Glu Asn Val Met Asp Ala Ala 
195 200 205 

Ala Arg 
210 

<210> 6 

<211> 774 

<212> DNA 

<213> Artificial 

<220> 

<223> TYLCSV coat Protein modified sequence 

<220> 

<221> CDS 

<222> CD . • (774) 



<400> 6 

atg cca aag aga act ggt gat att eta ate tea act ccc gtq tct aag 
Met Pro Lys Arg Thr Gly Asp lie Leu lie Ser Thr Pro val Ser Lys 
15 10 15 
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gtg cat agg aga ctt aac ttt gac tct ccg tac acc tct cgt gca get 96 
Val Arg Arg Arg Leu Asn Phe Asp Ser Pro Tyr Thr ser Arg Ala Ala 
20 25 30 

get ccc aca gtc cag ggc att aag agg cga tct tgg aca tac aga cct 
Ala Pro Thr val Gin Gly lie Lys Arg Arg Ser Trp Thr Tyr Arg Pro 
35 40 45 

atg tac agg aaa ccg agg atg tat agg atg tat cgt age cca gat gtg 
Met Tyr Arg Lys Pro Arg Met Tyr Arg Met Tyr Arg Ser Pro Asp val 
50 55 ~ 60 

cct cct ggt tgc gaa gga ccc tgc aag gtg caa teg tat gag caa cgt 
pro Pro Gly cys Glu Gly Pro cys Lys Val Gin ser Tyr Glu Gin Arg 
65 70 75 80 

gac gat gtg aag cac acc. gga gtt gtt cgt tgc gtt tct gat gtg act 
asp Asp val Lys His Thr Gly val Val Arg Cys val Ser Asp Val Thr 
85 90 95 

aga ggt tea ggt ate act cac agg gtg gga aag cgt ttc tgt att aag 
Arg Gly ser Gly lie Thr His Arg val Gly Lys Arg Phe Cys lie Lys 
100 105 110 

tct att tac ata ttg ggt aag ate tgg atg gac gag aat ate aag aaa 
ser lie Tyr lie Leu Gly Lys lie Trp Met Asp Glu Asn He Lys Lys 
115 120 125 

cag aat cac act aat cag gtt atg ttc ttt ctt gtg cga gat cga aga 
Gin Asn His Thr Asn Gin Val Met Phe Phe Leu val Arg Asp Arg Arg 
130 135 140 

cca tac gga acc age cca atg gac ttc ggc cag gtg ttt aat atg ttc 
pro Tyr Gly Thr ser Pro Met Asp Phe Gly Gin val Phe Asn Met Phe 
145 " 150 155 160 

gat aac gag cca tct act gca act gtg aaa aat gat ttg cgt gat aga 
Asp Asn Glu Pro Ser Thr Ala Thr Val Lys Asn Asp Leu Arg Asp Arg 
165 170 175 

tat cag gtg atg aga aag ttc cat gca acg gtg gtt ggt ggt cct tct 
Tyr Gin Val Met Arg Lys Phe His Ala Thr val Val Gly Gly Pro Ser 
180 185 190 

gga atg aaa gag caa tgt ctt ctg aaa aga ttc ttt aag ate aac act 
Gly Met Lys Glu Gin cys Leu Leu Lys Arg Phe Phe Lys He Asn Thr 
195 200 205 

cat gtc gtc tat aac cac cag gag caa gcg aaa tat gag. aat cac act 672 
His val val Tyr Asn His Gin Glu Gin Ala Lys Tyr Glu Asn His 'Thr 
210 215 220 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



720 



768 



gaa aat get ttg ttg tta tac atg gec tgt acc cac gca tct aat cca 
Glu Asn Ala Leu Leu Leu Tyr Met Ala Cys Thr His Ala Ser Asn Pro 
225 230 235 240 

gtt tac gca acg ctt aag ate cgt ate tat ttc tat gac get gtg aca 
val Tyr Ala Thr Leu Lys lie Arg He Tyr Phe Tyr Asp Ala Val Thr 
245 ~ 250 255 

aac tag 774 
Asn 



<210> 7 
<211> 257 
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<212> PRT 
<213> Artificial 

<220> 

<223> synthetic Construct 
<400> 7 

Met Pro Lys Arg Thr Gly Asp He Leu lie Ser Thr Pro val ser Lys 
15 10 15 

val Arg Arg Arg Leu Asn Phe Asp ser Pro Tyr Thr Ser Arg Ala Ala 
20 25 30 

Ala Pro Thr Val Gin Gly lie Lys Arg Arg ser Trp Thr Tyr Arg Pro 
35 40 45 

Met Tyr Arg Lys Pro Arg Met Tyr Arg Met Tyr Arg Ser Pro Asp Val 
50 ~ 55 60 

Pro Pro Gly Cys Glu Gly Pro Cys Lys Val Gin Ser Tyr Glu Gin Arg 
65 70 75 80 

Asp Asp val Lys His Thr Gly val val Arg cys val ser Asp val Thr 
85 90 95 

Arg Gly Ser Gly He Thr His Arg Val Gly Lys Arg Phe Cys. lie Lys 
100 105 ~ 110 

ser He Tyr lie Leu Gly Lys lie Trp Met Asp Glu Asn lie Lys Lys 
115 120 125 

Gin Asn His Thr Asn Gin Val Met Phe Phe Leu Val Arg Asp Arg Arg 
130 135 140 

Pro Tyr Gly Thr ser Pro Met Asp Phe Gly Gin Val Phe Asn Met Phe 
145 150 155 160 

Asp Asn Glu Pro Ser Thr Ala Thr val Lys Asn Asp Leu Arg Asp Arg 
165 170 175 

Tyr Gin val Met Arg Lys Phe His Ala Thr Val Val Gly Gly Pro Ser 
180 185 190 

Gly Met Lys Glu Gin Cys Leu Leu Lys Arg Phe Phe Lys lie Asn Thr 
195 - 200 " 205 

His Val Val Tyr Asn His Gin Glu Gin Ala Lys Tyr Glu Asn His Thr 
210 215 220 

Glu Asn Ala Leu Leu Leu Tyr Met Ala cys Thr His Ala Ser Asn Pro 
225 230 235 240 
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val Tyr Ala Thr Leu Lys lie Arg lie Tyr Phe Tyr Asp Ala Val Thr 
245 ~ 250 255 



Asn 



<210> 8 

<211> 447 

<212> DNA 

<213> Artificial 

<220> 

<223> TYLCSV Rep 130 sequence 
<220> 

<221> CDS 

<222> (51) . . (443) 

<220> 

<221> misc_feature 
<222> (231). .(231) ■ 

<223> Point mutation from C (Rep-210 wild-type) to T 
<220> 

<221> misc_feature 
<222> (233). .(233) 

<223> Point mutation from C (Rep 210 wild-type) to G 
<400> 8 

ggatccccct ggatactttg agtgtccccc gattcagaac gacagcaaaa atg cca 56 

Met Pro 
1 

aga tea ggt cgt ttt agt ate aag get aaa aat tat ttc ctt aca tat 104 
Arg Ser Gly Arg Phe Ser lie Lys Ala Lys Asn Tyr Phe Leu Thr Tyr 
5 " 10 15 

ccc aaa tgt gat tta aca aaa gaa aat gca ctt tec caa ata aca aac 152 
Pro Lys Cys Asp Leu Thr Lys Glu Asn Ala Leu Ser Gin He Thr Asn 
20 25 30 

eta caa aca ccc aca aac aaa tta ttc ate aaa att tgc aga gaa eta 200 
Leu Gin Thr Pro Thr Asn Lys Leu Phe lie Lys lie cys Arg Glu Leu 
35 40 45 50 

cat gaa aat ggg gaa cct cat etc cat att ttg ate caa ttc gaa gga 248 
His Glu Asn Gly Glu Pro His Leu His lie Leu lie Gin Phe Glu Gly 
55 60 65 

aaa tac aat tgt ace aat caa cga ttc ttc gac ctg gta tec cca acc 296 
Lys Tyr Asn cys Thr Asn Gin Arg Phe Phe Asp Leu Val Ser Pro Thr 
70 75 80 



agg tea gca cat ttc cat ccg aac att cag gga get aaa teg age tec 344 
Arg Ser Ala His Phe His Pro Asn He Gin Gly Ala Lys Ser ser Ser 
85 90 95 



gac gtc aag tec tat ate gac aag gac gga gat gtt ctt gaa tgg ggt 

Asp Val Lys ser Tyr lie Asp Lys Asp Gly Asp Val Leu Glu Trp Gly 

100 105 110 

act ttc cag ate gac gga cga tct get agg gga gga caa cag aca gee 

Thr Phe Gin He Asp Gly Arg Ser Ala Arg Gly Gly Gin Gin Thr Ala 

115 120 125 130 
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440 
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tga attc 447 

<210> 9 

<211> 130 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 9 

Met Pro Arg Ser Gly Arg Phe Ser lie Lys Ala Lys Asn Tyr Phe Leu 
1 5 10 15 

Thr Tyr Pro Lys Cys Asp Leu Thr Lys Glu Asn Ala Leu Ser Gin He 
20 25 30 

Thr Asn Leu Gin Thr Pro Thr Asn Lys Leu Phe He Lys lie Cys Arg 
35 40 45 

Glu Leu His Glu Asn Gly Glu Pro His Leu His lie Leu lie Gin Phe 
50 55 60 

Glu Gly Lys Tyr Asn Cys Thr Asn Gin Arg Phe Phe Asp Leu val ser 
65 70 75 80 

Pro Thr Arg ser Ala His Phe His Pro Asn lie Gin Gly Ala Lys Ser 
85 90 95 

Ser Ser Asp Val Lys ser Tyr lie Asp Lys Asp Gly Asp val Leu Glu 
100 105 110 

Trp Gly Thr Phe Gin lie Asp Gly Arg ser Ala Arg Gly Gly Gin Gin 
115 120 125 

Thr Ala 
130 

<210> 10 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<220> 

<2 2 1> mi s c_f eat u r e 

<222> CI).. (30) 

<223> Primer for PCR C4 mutagenesis 

<400> 10 

ctcatctcca tattttgatc caattcgaag 30 



<210> 11 
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<211> 30 
<212> DNA 
<213> Artificial 

<220> 

<223> Primer 
<220> 

<221> misc_feature 

<222> CI) . . C30) 

<223> Primer for PCR C4 mutagenesis 

<400> 11 on 
cttcgaattg gatcaaaata tggagatgag 

<210> 12 
<211> 774 
<212> DNA 

<213> Gemini virus TYLCSV 

<400> 12 cn 
atgccgaagc gaaccggcga tatactaatt tcaacgcccg tctcgaaggt tcgtcgaaga 60 

ctgaacttcg acagcccgta taccagccgt gctgctgccc ccactgtcca aggcatcaag 120 

cgtcgatcat ggacttacag gcccatgtat cgaaagccgc ggatgtacag aatgtacaga 180 

agccctgatg tacctccggg ttgtgaaggt ccctgtaaag tgcagtcgta cgagcagcgt 240 

gatgacgtca agcataccgg tgttgtgcgt tgtgttagtg atgtaactag gggttctggt 300 

attactcata gagttggtaa acgtttttgt atcaagtcaa tttatatatt aggaaagatt 360 

tggatggatg aaaacataaa aaaacaaaat catactaacc aagtgatgtt tttccttgtt 

cgagaccgaa ggccttatgg aactagtcct atggattttg gtcaagtttt taacatgttt 

gataatgaac ccagtactgc tacggtgaag aacgacttac gggataggta tcaagtaatg 540 

aggaagtttc atgctacggt tgttggaggt ccgtcaggga tgaaggagca gtgtttgctg 600 

aagagatttt ttaaaattaa tacccatgta gtttataatc accaagagca ggcgaagtat 660 

gaaaatcata ctgagaatgc cttgttattg tatatggctt gtactcatgc ttctaaccca 720 

gtgtacgcta cgttgaaaat acgtatttat ttttatgatg ctgtaacaaa ttaa 774 



420 
480 
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